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Sequences producing significant alignments: 
(Click headers to sort columns) 



Transcripts 



NM_003107 


2 


Homo sapiens SRY 
4 (S0X4), mRNA 




determining region 


Y) 


-box 


5012 


5012 


99% 
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99% 




NM_003108 
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Homo sapiens SRY 
11 (SOXll), mRNA 




determining region 
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-box 
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440 
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2 


Homo sapiens SRY 




determining region 


Y) 


-box 


291 


422 


12% 
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12 (S0X12), mRNA 






















NM_006941 




Homo sapiens SRY 
10 (SOXIO), mRNA 




determining region 


Y) 


-box 


167 


167 


7% 


7e-38 


77% 




NM_000346 


2 


Homo sapiens SRY (sex determining region 
9 (campomelic dysplasia, autosomal sex- 
reversal) (S0X9), mRNA 


Y) 


-box 


156 


156 


7% 


le-34 






NM_007084 


2 


Homo sapiens SRY 
21 (S0X21), mRNA 




determining region 


Y) 


-box 


140 


140 


7% 


le-29 


74% 
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Homo sapiens SRY 
15 (S0X15), mRNA 




determining region 




=b5r 


T29~~~ 


~129~~ 


■~7%~~" 


2e-26 


74% 




NM_014587 


2 


Homo sapiens SRY 
8 (S0X8), mRNA 




determining region 


Y) 


-box 


127 


127 


8% 


7e-26 


71% 




NM_031439 


2 


Homo sapiens SRY 
7 (S0X7), mRNA 




determining region 


Y) 


-box" 


113 


113 


7% 


le-21 


71% 


SB 


NM_018419 


2 


Homo sapiens SRY 
18 (S0X18), mRNA 




determining region 


Y) 


=bSr 


107 


107 


7% 


6e-20 


71% 




NM_004189 


2 


Homo sapiens SRY 
14 (S0X14), mRNA 




determining region 


Y) 


:^^b5r 


104 


104 


7% 


8e-19 


71% 




NM_022454 


3 


Homo sapiens SRY 
17 (S0X17), mRNA 




determining region 


Y) 


-box 


100 


100 


7% 


9e-18 


71% 
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NM_00 5634 


2 


Homo sapiens SRY 
3 (S0X3), mRNA 




determining region 


Y) 




100 


100 


7% 


9e-18 


"71% 




NM_005986 


2 


Homo sapiens SRY 
1 (SOXl), mRNA 




determining region 


Y) 




96.9 


96.9 


7% 


le-16 


70% 




NM_003106 


2 


Homo sapiens SRY 
2 (S0X2), mRNA 




determining region 


Y) 




91.5 


91.5 


7% 


5e-15 


69% 


SB 


NM_005686 


2 


Homo sapiens SRY 
13 (S0X13), mRNA 




determining region 


Y) 




78.8 


78.8 


7% 


3e-ll 


70% 




XM_001713691.1 


PREDICTED: Homo sapiens hypothetical protei 
LOC100130809 (LOC100130809 ) , mRNA 




64.4 


64.4 


4% 


7e-07 


74% 


NM_003140 


1 


Homo sapiens sex 
mRNA 


determining region Y (SRY), 


64.4 


64.4 


4% 


7e-07 


74% 




NM_152989 


2 


Homo sapiens SRY (sex 
5 (S0X5) , transcript ^ 


determining region 
/ariant 2, mRNA 


Y) 




57.2 


57.2 


3% 


le-04 


-jj^ 


Eta 


NM_006940 


4 


Homo sapiens SRY 
5 (S0X5), transci 


(sex 
ipt . 


determining region 
/ariant 1, mRNA 


Y) 




57.2 


57.2 


3% 


le-04 


73% 


1 


Niyi_178010 


1 


Homo sapiens SRY 
5 (SOX5), transci 


(sex 


determining region 
/ariant 3, mRNA 


Y) 




57.2 


57.2 


3% 


le-04 






NM_0528 76 


2 


Homo sapiens BTB 
(BTBD14B), mRNA 


(POZ 


domain containing 


14B 


51.8 


51.8 


1% 


0.004 


89% 




NlC.033326" 


"2 


Homo sapiens SRY 
6 (S0X6), transci 


ipt ^ 


determining region 
/ariant 2, mRNA 


Y) 




50.0 


50.0 


3% 


0.014 
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NM_017508 


1 


Homo sapiens SRY 
6 (SOX6), transci 




determining region 
/ariant 1, mRNA 


Y) 




50.0 


50.0 


3% 


0.014 
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din 46.4 46.4 1% 0.18 88% 



NM_001040001. 1 Homo sapiens myeloid/lymphoid or mixsd-lineage 46.4 46.4 1% 0.18 88^ 
leukemia (trithorax homolog, Drosophila) ; 
translocated to, 4 (MLLT4), transcript variant 
1 , mRNA 



NM_001040000. 1 Homo sapiens myeloid/lymphoid or mixed-lineage 46.4 46.4 1% 0.18 88% 
leukemia (trithorax homolog, Drosophila); 
translocated to, 4 (MLLT4), transcript variant 
2 , mRNA 

"n"mI"00'5936''.''2 Homo'''sapiens''myeloid^ 46""4 46"."4 1% o'.'l's"" 8 8% 

leukemia (trithorax homolog, Drosophila) ; 
translocated to, 4 (MLLT4) , transcript variant 



NmI015156"."2"' 



""""Homo""sapiens""RE^^^ 44"."6 44"6 1% o"."e 



Genomic sequences [show first] 



NT_ 


.007592.14 




Homo sapiens chromosome 6 
reference assembly 


genomic ( 




5012 


5094 


99% 


0.0 




99% 


NW. 


.001838973. 


.1 


Homo sapiens chromosome 6 
alternate assembly (based 
SCAF_1103279188126) 


genomic ( 
on HuRef 




1570 


3057 


60% 


0.0 




100% 


NW. 


.001838765. 


,1 


Homo sapiens chromosome 2 
alternate assembly (based 
SCAF_1103279188139) 


genomic ( 
on HuRef 




309 


440 


13% 


le- 


■80 


86% 


NT_ 


.005334.15 




Homo sapiens chromosome 2 
reference assembly 


genomic ( 




309 


440 


13% 


le- 


■80 


86% 


NW. 


.001838652. 


,1 


Homo sapiens chromosome 20 
alternate assembly (based 
SCAF_1103279188366) 


genomic 
on HuRef 


■contig; 


291 


601 


13% 


3e- 


■75 


88% 


NT_ 


_011387.8 




Homo sapiens chromosome 20 
reference assembly 


genomic 




291 


601 


13% 


3e- 


■75 


88% 


NW_ 


.001838084. 


2 


Homo sapiens chromosome 13 
alternate assembly (based 
SCAF_1103279188152) 


genomic 
on HuRef 




140 


182 


8% 


le- 


■29 


90% 


NT_ 


.009952.14 




Homo sapiens chromosome 13 
reference assembly 


genomic 




~~~~T40~~" 


~182~~ 




~Te^ 




90% 


NW. 


.001838403. 


,1 


Homo sapiens chromosome 17 
alternate assembly (based 
SCAF_1103279188371) 


genomic 
on HuRef 




129 


129 


7% 


2e- 


■26 


74% 


NT_ 


.010718.15 




Homo sapiens chromosome 17 
reference assembly 


genomic 


TS^Ttii;: — 


129 


~129 


~7% 


~"2e^ 




"74% 


NW. 


.001838454. 


,2 


Homo sapiens chromosome 17 
alternate assembly (based 
SCAF_11032 7918 8168) 


genomic 
on HuRef 




104 


246 


8% 


8e- 


■19 


90% 


NW. 


.001838884. 


2 


Homo sapiens chromosome 3 
alternate assembly (based 
SCAF_11032 7918 83 85) 


genomic ( 
on HuRef 




104 


366 


11% 


8e- 


■19 


92% 


NT_ 


.010641.15 




Homo sapiens chromosome 17 
reference assembly 


genomic 


contig. 


104 


163 


7% 




■19 


78% 


NT. 


.005612.15 




Homo sapiens chromosome 3 
reference assembly 






104 


448 


11% 


8e- 


■19 


92% 


NW. 


.001842405. 


1 


Homo sapiens chromosome X 
alternate assembly (based 
SCAF_1103279188404) 


genomic ^ 
on HuRef 




100 


141 


8% 


9e- 


■18 


87% 


NT. 


.011786.15 




Homo sapiens chromosome X 
reference assembly 


genomic i 




100 


223 


10% 


9e- 


■18 


92% 


NT. 


.027140.6 




Homo sapiens chromosome 13 
reference assembly 


genomic 


contig. 


96.9 


137 


8% 


le- 


■16 


92% 


nw_ 


.001838745. 


,1 


Homo sapiens chromosome 22 


genomic 


contig. 


95.1 


255 


9% 


4e- 


■16 


96% 
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alternate assembly (based on HuRef 
SCAF_1103279188372) 



NT_011520 .11 






95. 


. 1 


255 


9% 


4e-16 


96% 


NW_001839122. 


2 


Homo sapiens chromosome 8 genomic contig, 
SCAF_1103279188150) 


87. 


.8 


141 


6% 


6e-14 


-37% 


NT_077531.3 




Homo sapiens chromosome 8 genomic contig. 


87. 


,8 


141 


6% 


6e-14 


87% 


■nO'01838339". 


...... 


Homo sapiens chromosome 16 genomic contig, 
SCAF_1103279188181) 


86". 


.0 


139 


7% 


2e-13 


77% 


NT_03 7887.4 




Homo sapiens chromosome 16 genomic contig. 


86. 


.0 


139 


7% 


2e-13 


77% 


NW_001838533. 


...... 


Homo sapiens chromosome 1 genomic contig, 
alternate assembly (based on HuRef 
SCAF 1103279188157) 


77". 


;"o" 


"335 


6% 


le-10 


100% 


NW_001838085. 


.1 


Homo sapiens chromosome 13 genomic contig, 
SCAF_1103279188104) 


77. 


.0 


117 


4% 


le-10 


92% 


NT_004487.18 




Homo sapiens chromosome 1 genomic contig. 


™™ 


To" 


"376 


™6%™~ 


Te-To" 


T6o% 


NW_001839132. 


.1 


Homo sapiens chromosome 8 genomic contig, 
SCAF_1103279188282) 


66. 


.2 


285 


7% 


2e-07 


92% 


NT_008183.18 




Homo sapiens chromosome 8 genomic contig. 


66, 


.2 


328 


""7%"~ 


""Te-6T 




NW_001842422. 


1 


Homo sapiens chromosome Y genomic contig, 
alternate assembly (based on HuRef 
GCAF_110 32 7918 8414) 


64, 


. 4 


108 


5% 


7e-07 


84% 


NT 0118 96 . 9 




reference assembly 














NT_011333.5 




Homo sapiens chromosome 20 genomic contig. 


62. 


.6 


116 


6% 


2e-06 


79% 


NW_001838052. 


1 


Homo sapiens chromosome 12 genomic contig, 
alternate assembly (based on HuRef 
SCAF_1103279188408) 


57. 


.2 


140 


5% 


le-04 


96% 


NT_009714.16 




Homo sapiens chromosome 12 genomic contig. 


57. 


.2 


181 


5% 


le-04 


96% 


NW_001838671. 


,1 


Homo sapiens chromosome 20 genomic contig, 
alternate assembly (based on HuRef 
SCAF 1103279188118) 


53, 


.6 


53.6 


2% 


0.001 


79% 


NW_001838954. 


,2 


Homo sapiens chromosome 5 genomic contig, 
alternate assembly (based on HuRef 
SCAF_1103279188298) 


51, 




180 


5% 


0.004 


90% 


NT 011295.10 




reference assembly 














NT_023133.12 




Homo sapiens chromosome 5 genomic contig. 


51. 


.8 


180 


5% 


0.004 


90% 


NW_001838022. 


2 


Homo sapiens chromosome 11 genomic contig, 
alternate assembly (based on HuRef 
SCAF 1103279188392) 


50. 


.0 


131 


5% 


0.014 


92% 


NW_001838468. 


1 


Homo sapiens chromosome 18 genomic contig, 
alternate assembly (based on HuRef 
SCAF 1103279188038) 


50. 


.0 


50.0 


1% 


0.014 


87% 


NW_001838951. 


1 


Homo sapiens chromosome 5 genomic contig, 
alternate assembly (based on HuRef 
SCAF_11032 7918 83 95) 


50. 


.0 


94.5 


1% 


0.014 


87% 


NW_001838865. 


2 


Homo sapiens chromosome 2 genomic contig, 
alternate assembly (based on HuRef 
SCAF_11032 7918 8138) 


50. 


.0 


50.0 


1% 


0.014 


87% 
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NT 006576.15 




Homo sapiens chromosome 5 


















NT 009237.17 




Homo sapiens chromosome 11 
reference assembly 




50 . 


. 0 


131 


5% 


0 . 


.014 


92% 


NT 007819.16 




reference assembly 


















NT 010955.13 




reference assembly 





50" 


."6™ 


"soTo" 




■™ 


rsiT" 


"87% 


NT 010393.15 




reference assembly 


















NT 005403 . 16 




Homo sapiens chromosome 2 


















NT 005713 . 14 




reference assembly 




50 . 


. 0 


135 




„„. 




™8§% 


NW_001838051. 


2 


Homo sapiens chromosome 12 
SCAF_1103279188362) 


: genomic contig. 


46, 


.4 


251 


1% 


0. 


.18 


92% 


NW_001838005. 


2 


Homo sapiens chromosome IC 
SCAF_11032 79188397) 


1 genomic contig. 


46, 


.4 


130 


1% 


0. 


.18 


93% 


NW_001838988. 


2 


Homo sapiens chromosome 6 
SCAF_1103279187031) 


genomic contig. 


46, 


.4 


46.4 


1% 


0. 


.18 


93% 


NT 025741.14 




reference assembly 


















NT 113898.1 




reference assembly 




46 , 


. 4 


46 . 4 


1% 


0 , 


.18 


88% 


NT 007302.13 




Homo sapiens chromosome 6 
reference assembly 


genomic contig. 


46 , 


. 4 


46 . 4 


1% 


0 , 


.18 


88% 


NT 030059.12 




reference assembly 




46 , 


. 4 


171 


1% 


0 , 


.18 


93% 


NW_001838759. 


,1 


Homo sapiens chromosome 2 
alternate assembly (based 
SCAF_1103279188433) 


genomic contig. 


44, 


.6 


210 


4% 


0, 


.62 


96% 


NW_001838115. 


,2 


Homo sapiens chromosome 14 
SCAF_1103279188327) 


i genomic contig. 


44, 


.6 


85.5 


2% 


0, 


.62 


96% 


NW_001838461. 


,1 


Homo sapiens chromosome 18 

SCAF_1103279188344) 


1 genomic contig. 


44, 


.6 


44.6 


1% 


0, 


.62 


90% 


NW_001838758. 


,1 


Homo sapiens chromosome 2 
alternate assembly (based 
SCAF_110 32 7918 83 06) 


genomic contig. 


44, 


.6 


44.6 


1% 


0, 


.62 


83% 


NW_001838987. 


,1 


Homo sapiens chromosome 6 
SCAF_110 32 7918 82 74B) 


genomic contig. 


44, 


.6 


85.5 


1% 


0, 


.62 


92% 


NW_001838928. 


,2 


Homo sapiens chromosome 5 
SCAF_1103279188369) 


genomic contig. 


44, 


.6 


44.6 


0% 


0, 


.62 


96% 


NW_001838915. 


1 


Homo sapiens chromosome 4 
SCAF_11032 7918 83 99) 


genomic contig. 


44, 


.6 


249 


1% 


0, 


.62 


96% 


NW_001838878. 


1 


Homo sapiens chromosome 3 
SCAF_11032 79188187) 


genomic contig. 


44, 


.6 


85.5 


1% 


0, 


.62 


90% 


NW_001838953. 


2 


Homo sapiens chromosome 5 
alternate assembly (based 
SCAF_1103279188146) 


genomic contig. 


44, 


.6 


44.6 


1% 


0, 


.62 


88% 






Homo sapiens chromosome 14 
reference assembly 


I genomic contig. 


44". 


."e" 


■■249 


■■■■2% 


0". 


"62" 


■92% 


NT_016354.18 








44. 


.6 


536 


1% 


0. 


.62 


96% 
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Homo sapiens chromosome 4 genomic contig. 









reference 


assembly 




















008470 . 






assembly 


9 genomic cont.g. 


44. 
















02~2T59^ 




reference 


assembly 




















.032977 . 


. 8 


Homo sapi€ 
reference 


ins chromosome 
assembly 


1 genomic contig. 


44. 


. 6 


374 


3% 


0 . 


.62 


96% 


NT_ 


.010859. 


.14 


Homo sap is 
reference 


ms chromosome 
assembly 


18 genomic contig. 


44. 


.6 


44.6 


1% 


0. 


.62 


90% 
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>ref |NM_003107.2| HUIkSlMl Homo sapiens SRY (sex determining region Y)-box 4 (S0X4), mRNA 

Length=4912 



Score = 
Identit 
Strand= 



Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 
Query 
Sbjct 



5012 bits (5558), Expect = 0.0 
es = 2789/2793 (99%), Gaps = 2/2793 (0%) 

Plus/Plus 

5 CCAGCATTCGAGAAACTCCTCTCTACTTTAGCACGGTCTCCAGACTCAGCCGAGAGACAG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

4 49 CCAGCATTCGAGAAACTCCTCTCTACTTTAGCACGGTCTCCAGACTCAGCCGAGAGACAG 

65 CAAACTG TCTCCAGCCTGGGAA 
I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

5 09 CAAACTGCAGCGCGGTGAGAGAGCGAGAGAGAGGGAGAGAGAGACTCTCCAGCCTGGGAA 

125 CTATAACTCCTCTGCGAGAGGCGGAGAACTCCTTCCCCAAATCTTTTGGGGACTTTTCTC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
569 CTATAACTCCTCTGCGAGAGGCGGAGAACTCCTTCCCCAAATCTTTTGGGGACTTTTCTC 

185 TCTTTACCCACCTCCGCCCCTGCGAGGAGTTGAGGGGCCAGTTCGGCCGCCGCGCGCGTC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
629 TCTTTACCCACCTCCGCCCCTGCGAGGAGTTGAGGGGCCAGTTCGGCCGCCGCGCGCGTC 

2 45 TTCCCGTTCGGCGTGTGCTTGGCCCGGGGAACCGGGAGGGCCCGGCGATCGCGCGGCGGC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

6 89 TTCCCGTTCGGCGTGTGCTTGGCCCGGGGAACCGGGAGGGCCCGGCGATCGCGCGGCGGC 

3 05 CGCCGCGAGGGTGTGAGCGCGCGTGGGCGCCCGCCGAGCCGAGGCCATGGTGCAGCAAAC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

7 49 CGCCGCGAGGGTGTGAGCGCGCGTGGGCGCCCGCCGAGCCGAGGCCATGGTGCAGCAAAC 

365 CAACAATGCCGAGAACACGGAAGCGCTGCTGGCCGGCGAGAGCTCGGACTCGGGCGCCGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

8 09 CAACAATGCCGAGAACACGGAAGCGCTGCTGGCCGGCGAGAGCTCGGACTCGGGCGCCGG 

42 5 CCTCGAGCTGGGAATCGCCTCCTCCCCCACGCCCGGCTCCACCGCCTCCACGGGCGGCAA 
I I I I I I I I I I I I i 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
869 CCTCGAGCTGGGAATCGCCTCCTCCCCCACGCCCGGCTCCACCGCCTCCACGGGCGGCAA 

485 GGCCGACGACCCGAGCTGGTGC7VAGACCCCGAGTGGGCACATCAAGCGACCCATGAACGC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
929 GGCCGACGACCCGAGCTGGTGCAAGACCCCGAGTGGGCACATCAAGCGACCCATGAACGC 

545 CTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGCA 
I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

9 89 CTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGCA 

6 05 CAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

10 49 CAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAA 

665 GATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGA 
I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I [ I I I I I I I I I I I I I I I I I I 
1109 GATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGA 

72 5 CTACAAGTACCGGCCCAGGAAGAAGGTGAAGTCCGGCAACGCCAACTCCAGCTCCTCGGC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
1169 CTACAAGTACCGGCCCAGGAAGAAGGTGAAGTCCGGCAACGCCAACTCCAGCTCCTCGGC 

785 CGCCGCCTCCTCCAAGCCGGGGGAG - - . . 

I I I I I I I I I I I I I I I I I I I I I I I I I ] i I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
12 2 9 CGCCGCCTCCTCCAAGCCGGGGGAGAAGGGAGACAAGGTCGGTGGCAGTGGCGGGGGCGG 

845 - - . . ^ - . . . . - . - . . ^ - 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

12 89 CCATGGGGGCGGCGGCGGCGGCGGGAGCAGCAACGCGGGGGGAGGAGGCGGCGGTGCGAG 

905 ^ . o ^CCAACTCCAAACCGGCGCAGAAAAAGAGCTGCGGCTCCAAAGTGGCGGG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

13 49 TGGCGGCGGCGCCAACTCCAAACCGGCGCAGAAAAAGAGCTGCGGCTCCAAAGTGGCGGG 



508 
124 
568 
184 
628 
244 
688 
304 
748 
364 
808 
424 
868 
484 



604 
1048 
664 
1108 
724 
1168 
784 
1228 
844 
1288 
904 
1348 
964 
1408 
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965 


Sbjct 


1409 




1025 


Sbjct 


1469 




1085 


Sbjct 


1529 




1145 


Sbjct 


1589 




1205 


Sbjct 


1649 




1265 


Sbjct 


1709 




1325 


Sbjct 


1769 




1385 


Sbjct 


1829 




1445 


Sbjct 


1889 




1505 


Sbjct 


1949 




1565 


Sbjct 


2009 




1625 


Sbjct 


2069 




1685 


Sbjct 


2129 




1745 


Sbjct 


2189 




1805 


Sbjct 


2249 




1865 


Sbjct 


2309 




1924 


Sbjct 


2369 




1984 


Sbjct 


2429 




2044 


Sbjct 


2489 




2103 






Query 


2163 



CGGCGCGGGCGGTGGGGTTAGCAAACCGCACGCCAAGCTCATCCTGGCA. . ^ 1024 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i i I I I 

CGGCGCGGGCGGTGGGGTTAGCAAACCGCACGCCAAGCTCATCCTGGCAGGCGGCGGCGG 1468 

cqacaaGAAAGCAGCGGCTaccaccqccacctcctfccaccQCcaa5:Caqqcaaac;qcc;3c 1084 
I I I I I i I I I I I I I I I I I I M I I I I I T I I I I I I I I I I I I I I I I I f I I I I fl I I I r I' 1' I I I I 

CGGCGGGAAAGCAGCGGCTGCCGCCGCCGCCTCCTTCGCCGCCGAACAGGCGGGGGCCGC 1528 

cc?ccc;;Qc;:acccctqaacaccqccaccaACCACCACTCGCTGTACAAGGCGCGGACTCC 1144 
I I I I I I I I I [ I I I I I 11 [ I I I I I I I [ I I j I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CGCCCTGCTGCCCCTGGGCGCCGCCGCCGACCACCACTCGCTGTACAAGGCGCGGACTCC 1588 

CAGCGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGCGCTCGCGGCCCCGGG 1204 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CAGCGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGCGCTCGCGGCCCCGGG 1648 

CAAGCACCTGGCGGAGAAGAAGGTGAAGCGCGTCTACCTGTTCGGCGGCCTGGGCACGTC 1264 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CAAGCACCTGGCGGAGAAGAAGGTGAAGCGCGTCTACCTGTTCGGCGGCCTGGGCACGTC 1708 

GTCGTCGCCCGTGGGCGGCGTGGGCGCGGGAGCCGACCCCAGCGACCCCCTGGGCCTGTA 1324 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GTCGTCGCCCGTGGGCGGCGTGGGCGCGGGAGCCGACCCCAGCGACCCCCTGGGCCTGTA 1768 

CGAGGAGGAGGGCGCGGGCTGCTcc?ccca5cqcc?ccc5qcc".aaqc-qcT-:c:ac.;KK^acjcac 1384 
I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I '[ I I I I I I I M I I I I I I I I I I I M I' M I 

CGAGGAGGAGGGCGCGGGCTGCTCGCCCGACGCGCCCAGCCTGAGCGGCCGCAGCAGCGC 1828 

I iTm'TTi I i'T I'TtTTiM ittTTTi TitTTTTi1tTTTTTi1'TTTT^^ 1444 

CGCCTCGTCCCCCGCCGCCGGCCGCTCGCCCGCCGACCACCGCGGCTACGCCAGCCTGCG 1888 

cc:CCCjc;ctcc:CCCcyccccq;:.cc;aaccK;cK:^ccTCGCACGCGtccfccctc«qcctcatccca 1504 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I 

CGCCGCCTCGCCCGCCCCGTCCAGCGCGCCCTCGCACGCGTCCTCCTCGGCCTCGTCCCA 1948 

ctcctcctcttcctcctcctcaaqctcctcatcctccaacqacaaattcqaaaacGACCT 1564 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I [ I 1 I I I I I I I I I I I I I I 

CTCCTCCTCTTCCTCCTCCTCGGGCTCCTCGTCCTCCGACGACGAGTTCGAAGACGACCT 2008 

GCTCGACCTGAACCCCAGCTCAAACTTTGAGAGCATGTCCCTGGGCAGCTTCAGTTCGTC 1624 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GCTCGACCTGAACCCCAGCTCAAACTTTGAGAGCATGTCCCTGGGCAGCTTCAGTTCGTC 2068 

GTCGGCGCTCGACCGGGACCTGGATTTTAACTTCGAGCCCGGCTCCGGCTCGCACTTCGA 1684 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GTCGGCGCTCGACCGGGACCTGGATTTTAACTTCGAGCCCGGCTCCGGCTCGCACTTCGA 2128 

GTTCCCGGACTACTGCACGCCCGAGGTGAGCGAGATGATCTCGGGAGACTGGCTCGAGTC 1744 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GTTCCCGGACTACTGCACGCCCGAGGTGAGCGAGATGATCTCGGGAGACTGGCTCGAGTC 2188 

CAGCATCTCCAACCTGGTTTTCACCTACTGAAGGGCGCGCAGGCAGGGAGAAGGGCCqoo- 1804 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CAGCATCTCCAACCTGGTTTTCACCTACTGAAGGGCGCGCAGGCAGGGAGAAGGGCCGGG 2248 

«o-aaaTAGGAGAGGAqa.^ . . -aa v ^ . : ^aaaa ^ . ^q.v . ^ aoa ^ -a ^ i v. 1864 

rf iH' 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r I I I I II I I I I I I I I I' I I I I I 1 I 1 I 

GGGGGT AGGAGAGGAGAAAAAAAAAGTGAAAAAAAGAAACGAAAAGGAC AGACGAAGAGT 2308 

ttaaaqaaaaa.;=qqqaaaa.u«qaaaaaoaaaataaqcaaQaCTCG-TTCGCCCGCGTTCT 1923 
I I I I I I I i I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I 

TTAAAGAGAAAAGGGAAAAAAGAAAGAAAAAGTAAGCAGGGCTGGCTTCGCCCGCGTTCT 2368 

CGTCGTCGGATCAAGGAGCGCGGCGGCGTTTTGGACCCGCGCTCCCATCCCCCACCTTCC 1983 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGTCGTCGGATCAAGGAGCGCGGCGGCGTTTTGGACCCGCGCTCCCATCCCCCACCTTCC 2428 

CGGGCCGGGGACCCACTCTGCCCAGCCGGAGGGACGCGGAGGAGGAAGAGGGTAGACAGG 2043 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGGGCCGGGGACCCACTCTGCCCAGCCGGAGGGACGCGGAGGAGGAAGAGGGTAGACAGG 2488 

GGCGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATGGC-aa.:U:aaaaaa.:iGCGACTT 2102 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GGCGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATGGCAAAAAAAAAAAAGCGACTT 2548 

CGAGTTTGCTCCCCTTTGCTTGAAGAGACCCCCTCCCCCTTCCAACGAGCTTCCGGACTT 2162 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CGAGTTTGCTCCCCTTTGCTTGAAGAGACCCCCTCCCCCTTCCAACGAGCTTCCGGACTT 2608 

2222 
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Sb3Ct 




Query 


2223 


Sbjct 




Query 


2283 


SbDCt 




Query 


2343 


SbDCt 




Query 


2403 


SbDCt 




Query 


2463 


SbDCt 




Query 


2523 


SbDCt 




Query 


2583 


SbDCt 




Query 


2643 


Sb3Ct 




Query 


2703 


SbDCt 


3149 


Query 


2763 


Sbjct 


3209 
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GTCTGCACCCCCAGCAAGAAGGCGAGTTAGTTTTCTAGAGACTTGAAGGAGTCTCCCCCT 2668 

TCCTGCATCACCACCTTGGTTTTGTTTTATTTTGCTTCTTGGTCAAGAAAGGAGGGGAGA 2282 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCCTGCATCACCACCTTGGTTTTGTTTTATTTTGCTTCTTGGTCAAGAAAGGAGGGGAGA 2728 

ACCCAGCGCACCCCTccccccctfctttttAAACGCGTGATGAAGACAGAAGGCTCCGGGG 2342 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACCCAGCGCACCCCTCCCCCCCTTTTTTTAAACGCGTGATGAAGACAGAAGGCTCCGGGG 2788 

TGACGAATTTGGCCGATGGCAGATGTTTTGGGGGAACGCCGGGACTGAGAGACTCCACGC 2402 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGACGAATTTGGCCGATGGCAGATGTTTTGGGGGAACGCCGGGACTGAGAGACTCCACGC 2848 

AGGCGAATTCCCGTTTGGGGCCtttttbtCCTCCCTCTTTTCCCCTTGCCCCCTCTGCAG 2462 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGGCGAATTCCCGTTTGGGGCTTTTTTTTCCTCCCTCTTTTCCCCTTGCCCCCTCTGCAG 2908 

CCGGAGGAGGAGATGTTGAGGGGAGGAGGCCAGCCAGTGTGACCGGCGCTAGGAAATGAC 2522 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCGGAGGAGGAGATGTTGAGGGGAGGAGGCCAGCCAGTGTGACCGGCGCTAGGAAATGAC 2968 

CCGAGAACCCCGTTGGAAGCGCAGCAGCGGGAGCTAGGGGCGGGGGCGGAGGAGGACACG 2582 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCGAGAACCCCGTTGGAAGCGCAGCAGCGGGAGCTAGGGGCGGGGGCGGAGGAGGACACG 3028 

AACTGGAAGGGGGTTCACGGTCAAACTGAAATGGATTTGCACGTTGGGGAGCTGGCGGCG 2642 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AACTGGAAGGGGGTTCACGGTCAAACTGAAATGGATTTGCACGTTGGGGAGCTGGCGGCG 3088 

GCGGCTGCTGGGCCTCCGCCTTCTTTTCTACGTGAAATCAGTGAGGTGAGACTTCCCAGA 2702 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCGGCTGCTGGGCCTCCGCCTTCTTTTCTACGTGAAATCAGTGAGGTGAGACTTCCCAGA 3148 

CCCCGGAGGCGTGGAGGAGAGGAGACTGTTTGATGTGGTACAGGGGCAGTCAGTGGAGGG 2762 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCCCGGAGGCGTGGAGGAGAGGAGACTGTTTGATGTGGTACAGGGGCAGTCAGTGGAGGG 3208 

CGAGTGGTTTCGGaaaaaa-i-saaaaaa-i-saaaG 2795 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGAGTGGTTTCGGAAAAAAAAAAAGAAAAAAAG 3241 



>ref |NM_003108.3| Homo sapiens SRY (sex determining region Y)-box 11 (SOXll), mRNA 

Length=8737 

GENE ID: 6664 SOXll I SRY (sex determining region Y)-box 11 [Homo sapiens] 
(Over 10 PubMed links) 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 



Score = 309 bits (342), Expect = le-80 
Identities = 224/259 (86%), Gaps = 0/259 (0%) 
Strand=Plus/Plus 

CGACCCGAGCTGGTGCAAGACCCCGAGTGGGCACATCAAGCGACCCATGAACGCCTTCAT 550 
I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I II I I I I I I I I I I I I I 
CGACCCAGACTGGTGCAAGACGGCGTCGGGCCACATCAAGCGGCCGATGAACGCGTTCAT 225 

GGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGCACAACGC 610 
III I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
GGTATGGTCCAAGATCGAACGCAGGAAGATCATGGAGCAGTCTCCGGACATGCACAACGC 285 

CGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAAGATCCC 670 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
CGAGATCTCCAAGAGGCTGGGCAAGCGCTGGAAAATGCTGAAGGACAGCGAGAAGATCCC 345 

TTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAA 730 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTTCATCCGGGAGGCGGAGCGGCTGCGGCTCAAGCACATGGCCGACTACCCCGACTACAA 405 

GTACCGGCCCAGGAAGAAG 749 
I I I I I I I I I I I I I I III 

GTACCGGCCCCGGAAAAAG 424 



Query 


491 


Sbjct 


166 


Query 


551 


Sbjct 


226 


Query 


611 


Sbjct 


286 


Query 


671 


Sbjct 


346 


Query 


731 


Sbjct 


406 



Score = 131 bits (144), Expect = 5e-27 
Identities = 96/112 (85%), Gaps = 0/112 (0%) 
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Strand=Plus/Plus 

Query 1671 GGCTCGCACTTCGAGTTCCCGGACTACTGCACGCCCGAGGTGAGCGAGATGATCTCGGGA 1730 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I 
Sbjct 1277 GGCTCCCACTTCGAGTTCCCCGACTACTGCACGCCGGAGCTGAGCGAGATGATCGCGGGG 1336 

Query 1731 GACTGGCTCGAGTCCAGCATCTCCAACCTGGTTTTCACCTACTGAAGGGCGC 1782 

I I I I I I I I III I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Sbjct 1337 GACTGGCTGGAGGCGAACTTCTCCGACCTGGTGTTCACATATTGAAAGGCGC 1388 



>ref |NM_006943.2| Homo sapiens SRY (sex determining region Y)-box 12 (S0X12), mRNA 

Length=4645 

GENE ID: 6666 S0X12 I SRY (sex determining region Y)-box 12 [Homo sapiens] 
(10 or fewer PubMed links) 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 



Score = 291 bits (322), Expect = 3e-75 
Identities = 213/246 (86%), Gaps = 1/246 (0%) 
Strand=Plus/Plus 

CCCGAGCTGGTGCAAGACCCCGAGTGGGCACATCAAGCGACCCATGAACGCCTTCATGGT 553 
I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I 
CCCG-GCTGGTGCAAGACCCCGAGCGGCCACATCAAGAGGCCGATGAACGCATTCATGGT 476 

GTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGC ACAACGCCGA 613 
I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTGGTCGCAGCACGAACGGCGGAAGATCATGGACCAGTGGCCCGACATGC ACAACGCCGA 536 

GATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAAGATCCCTTT 673 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GATCTCCAAGCGCCTGGGCCGCCGCTGGCAGCTGCTGCAGGACTCGGAGAAGATCCCGTT 596 

CATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTA 733 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
CGTGCGGGAGGCGGAGCGGCTGCGGCTCAAGCACATGGCGGATTACCCGGACTACAAGTA 656 

CCGGCC 739 
I I I I I I 

CCGGCC 662 



Query 


494 


Sbjct 


418 


Query 


554 


Sbjct 


477 


Query 


614 


Sbjct 


537 


Query 


674 


Sbjct 


597 


Query 


734 


Sbjct 


657 



Score = 131 bits (144), Expect = 5e-27 
Identities = 90/102 (88%), Gaps = 0/102 (0%) 
Strand=Plus/Plus 

Query 1674 TCGCACTTCGAGTTCCCGGACTACTGCACGCCCGAGGTGAGCGAGATGATCTCGGGAGAC 1733 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I III 
Sbjct 1177 TCGCACTTCGAGTTCCCGGACTACTGCACCCCCGAGGTTACCGAGATGATCGCGGGGGAC 1236 

Query 1734 TGGCTCGAGTCCAGCATCTCCAACCTGGTTTTCACCTACTGA 1775 

I I I I I III II II II I I I I I I I I I I I I I I I I I I I I I 
Sbjct 1237 TGGCGCCCGTCTAGCATCGCAGACCTGGTTTTCACCTACTGA 1278 



>ref I NM_006941 . 3 I fil^^El Homo sapiens SRY (sex determining region Y)-box 10 (SOXIO), mRNA 

Length=2882 

GENE ID: 6663 SOXIO I SRY (sex determining region Y)-box 10 [Homo sapiens] 
(Over 10 PubMed linlts) 



Score = 167 bits (184), Expect = 7e-38 
Identities = 173/224 (77%), Gaps = 2/224 (0%) 
Strand=Plus/Plus 



Query 


521 


GCACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGAT 


580 


Sbjct 


584 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II II 1 1 III 1 III 1 III 1 
GCACGTCAAGCGGCCCATGAACGCCTTCATGGTGTGGGCTCAGGCAGCGCGCAGGAAGCT 


643 


Query 


581 


CATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTG 


640 


Sbjct 


644 


1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 INI 1 1 1 1 1 1 1 1 1 1 1 1 1 
CGCGGACCAGTACCCGCACCTGCACAACGCTGAGCTCAGCAAGACGCTGGGCAAGCTCTG 


703 


Query 


641 


GAAGCTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCC 
II 1 1 1 1 1 1 II II II 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 


699 
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NCBI Blast:X70683:H.sapiens mRNA for SOX-4 protein Page 12 of 75 

Sbjct 704 GAGGCTGCTGAACGAAAGTGACAAGCGCCCCTTCA-TCGAGGAGGCTGAGCGGCTCCGTA 762 

Query 700 TCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCCCAGG 743 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 763 TGCAGCACAAGAAAGACCACCCGGACTACAAGTACCAGCCCAGG 806 

>ref I NM_000346 . 2 I B^'^jl Homo sapiens SRY (sex determining region Y)-box 9 (campomelic 
dysplasia, autosomal sex-reversal) (S0X9), mRNA 

Length=3935 

GENE ID: 6662 S0X9 I SRY (sex determining region Y)-box 9 (campomelic 
dysplasia, autosomal sex-reversal) [Homo sapiens] (Over 10 PubMed links) 



Score = 156 bits (172), Expect = le-34 
Identities = 166/219 (75%), Gaps = 0/219 (0%) 

Strand=Plus/Plus 



CATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTG 

I I I I I I I I II II I I I I II I I I I II II I I I I I I I I I I I I I I I I I I I 
CGCGGACCAGTACCCGCACTTGCACAACGCCGAGCTCAGCAAGACGCTGGGCAAGCTCTG 

GAAGCTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCT 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I II II II II II II I I 
GAGACTTCTGAACGAGAGCGAGAAGCGGCCCTTCGTGGAGGAGGCGGAGCGGCTGCGCGT 



Query 


521 


Sbjct 


681 


Query 


581 


Sbjct 


741 


Query 


641 


Sbjct 


801 


Query 


701 


Sbjct 


861 



CAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 
I I I I I I I III I I I I II I I I I I I I I I I III 
GCAGCACAAGAAGGACCACCCGGATTACAAGTACCAGCC 



>ref I NM_007084 . 2 I Ci..i^2.uJ Homo sapiens SRY (sex determining region Y)-box 21 (S0X21), mRNA 
Length=2537 

GENE ID: 11166 30X21 I SRY (sex determining region Y)-box 21 [Homo sapiens] 
(10 or fewer PubMed links) 



Score = 140 bits (154), Expect = le-29 
Identities = 164/219 (74%), Gaps = 2/219 (0%) 
Strand=Plus/Plus 

CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 581 
III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CACGTCAAGCGGCCCATGAACGCCTTCATGGTGTGGTCGCGGGCTCAGCGGCGCAAGATG 164 

ATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGG 641 



AAGCTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCCT 
I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I 

AAACTGCTCACAGAGTCGGAGAAGCGGCCGTTCA-TCGACGAGGCCAAGCGTCTACGCGC 



Query 


522 


Sbjct 


105 


Query 


582 


Sbjct 


165 


Query 


642 


Sbjct 


225 


Query 


701 


Sbjct 


284 



CAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 
II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
CATGCACATGAAGGAGCACCCCGACTACAAGTACCGGCC 



>ref I NM_006942 . 1 I llI^^E Homo sapiens SRY (sex determining region Y)-box 15 (S0X15), mRNA 

Length=1394 

GENE ID: 6665 S0X15 I SRY (sex determining region Y)-box 15 [Homo sapiens] 
(Over 10 PubMed links) 



Score = 129 bits (142), Expect = 2e-26 
Identities = 158/213 (74%), Gaps = 2/213 (0%) 
Strand=Plus/Plus 

Query 528 AAGCGACCCATGAACGCCTTCATGGTGTGGT-CGCAGATCGAGCGGCGCAAGATCATGGA 586 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I III MM I I 
Sbjct 642 AAGCGGCCGATGAACGCGTTCATGGTGTGGAGCTCCGCTC-AGCGCCGCCAGATGGCGCA 700 

Query 587 GCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCT 646 

II II III I II II II II I II II II II II II II II II II II I II II II II 



)://blast.ncbi.nlm.nih.gov/Blast.( 



7/16/08 



Sb3Ct 




Query 




Sbjct 




Query 




Sbjct 


821 
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GCAGAACCCCAAGATGCACAACTCCGAGATCTCCAAGCGCCTGGGCGCGCAGTGGAAGCT 760 

GCTCAAAGACAGCGACAAGATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCA 706 
III III I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

GCTGGACGAGGACGAGAAGCGGCCCTTCGTGGAGGAGGCCAAGCGGCTCCGCGCCCGACA 820 

CATGGCTGACTACCCCGACTACAAGTACCGGCC 739 
I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCTGCGCGACTACCCCGACTACAAGTACCGGCC 853 

>ref |NM_014587.2| Homo sapiens SRY (sex determining region Y)-box 8 (S0X8), mRNA 

Length=3069 

GENE ID: 30812 S0X8 I SRY (sex determining region Y)-box 8 [Homo sapiens] 
(Over 10 PubMed links) 

Score = 127 bits (140), Expect = 7e-26 
Identities = 179/252 (71%), Gaps = 6/252 (2%) 
Strand=Plus/Plus 

AAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAG 587 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I II 
AAGCGGCCCATGAACGCATTCATGGTGTGGGCGCAGGCGGCGCGCCGCAAGCTGGCCGAC 481 

CAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTG 647 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II 
CAGTACCCGCACCTGCACAACGCCGAGCTCAGCAAGACGCTGGGCAAGCTGTGGCGCTTG 541 

CTCAAAGACAGCGACAAGATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCAC 707 
III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CTGAGCGAGAGCGAGAAGCGGCCCTTCGTGGAGGAGGCAGAGCGCCTTCGCGTGCAGCAC 601 

ATGGCTGACTACCCCGACTACAAGTAC CGGCCCAGGAAGAAGGTGAAGTCCGGC 761 

I I I I I I I I I I I I I II I I I I I 

661 

AACGCCAACTCC 773 

II I I I I I I 
CACAGCGACTCC 673 



Query 


528 


Sbjct 


422 


Query 


588 


Sbjct 


482 


Query 


648 


Sbjct 


542 


Query 


708 


Sbjct 


602 


Query 


762 


Sbjct 


662 



>ref |NM_031439.2| 111^^011 Homo sapiens SRY (sex determining region Y)-box 7 (S0X7), mRNA 
Length=3219 

GENE ID: 83595 S0X7 I SRY (sex determining region Y)-box 7 [Homo sapiens] 
(10 or fewer PubMed links) 



Score = 113 bits (124), Expect = le-21 
Identities = 159/223 (71%), Gaps = 3/223 (1%) 

Strand=Plus/Plus 

GCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCA 5 89 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I II 

GCGGCCCATGAACGCCTTCATGGTTTGGGCCAAGGACGAGAGGAAACGGCTGGCAGTGCA 2 76 

GTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCT 6 49 
I II I II II II II II II II II II I I I I I I I I I I I I I I I I I I III 

GAACCCGGACCTGCACAACGCCGAGCTCAGCAAGATGCTGGGAAAGTCGTGGAAGGCGCT 336 

CAAAGACAGCGACAAGATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACAT 709 

I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I 

GACGCTGTCCCAGAAGAGGCCGTACGTGGACGAGGCGGAGCGGCTGCGCCTGCAGCACAT 396 

GGCTGACTACCCCGACTACAAGTACCGGCC CAGGAAGAAG 749 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 I I I I I 
GCAGGACTACCCCAACTACAAGTACCGGCCGCGCAGGAAGAAG 439 



Query 


530 


Sbjct 


217 


Query 


590 


Sbjct 


277 


Query 


650 


Sbjct 


337 


Query 


710 


Sbjct 


397 



>ref I NM_018419 . 2 I Homo sapiens SRY (sex determining region Y)-box 18 (S0X18), mRNA 

Length=1718 

GENE ID: 54345 S0X18 I SRY (sex determining region Y)-box 18 [Homo sapiens] 
(Over 10 PubMed links) 



Score = 107 bits (118), Expect = 6e-20 
Identities = 160/223 (71%), Gaps = 8/223 (3%) 
Strand=Plus/Plus 



)://blast.ncbi.nlm.nih.gov/Blast.( 



7/16/08 





521 


Sbjct 


359 




581 


Sbjct 


416 




638 


Sbjct 


476 






Sbjct 


535 



NCBI Blast:X70683:H.sapiens mRNA for SOX-4 protein Page 14 of 75 

GCACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGAT 580 

II I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
GCGCATCCGGCGGCCCATGAACGCCTTCATGGTGTGGGCAAAGGACGAGCG CAAGCG 415 

CATGG AGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACG 637 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCTGGCTCAGCAGAACCCGGACCTGCACAACGCGGTGCTCAGCAAGATGCTGGGCAAAGC 475 

CTGGAAGCTGCTCAAAGACAGCG-ACAAGATCCCTTTCATTCGAGAGGCGGAGCGGCTGC 696 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GTGGAAGGAGCTGAACG-CGGCGGAGAAGCGGCCCTTCGTGGAGGAAGCCGAACGGCTGC 534 

GCCTCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCGTGCAGCACTTGCGCGACCACCCCAACTACAAGTACCGGCC 577 

>ref |NM_004189.2 I Homo sapiens SRY (sex determining region Y)-box 14 (S0X14), mRNA 

Length=818 



Score = 104 bits (114), Expect = 8e-19 
Identities = 161/224 (71%), Gaps = 12/224 (5%) 
Strand=Plus/Plus 

CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 581 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II 
CACATCAAGCGGCCCATGAACGCCTTCATGGTATGGTCCCGGGGCCAGCGGCGCAAGATG 126 

ATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGG CAAACGC 63 8 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
GCCCAGGAAAACCCCAAGATGCACAACTCGGAGATCAGCAAACGCCTAGGTGCCGAA 183 

TGGAAGCTGC— TCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTG 695 
I I I I I I II 1 II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGGAAGCTTCTGTCCGAGGCA — GAGAAGCGGCCATACA-TCGATGAAGCCAAGCGGCTA 240 

CGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 
III I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
CGCGCCCAGCACATGAAGGAGCACCCTGACTACAAGTACCGACC 284 

>ref |NM_022454.3 1 Homo sapiens SRY (sex determining region Y)-box 17 (S0X17), mRNA 

Length=2350 



Score = 100 bits (110), Expect = 9e-18 
Identities = 153/214 (71%), Gaps = 8/214 (3%) 
Strand=Plus/Plus 

GCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCA 589 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

GCGGCCGATGAACGCTTTCATGGTGTGGGCTAAGGACGAGCG CAAGCGGCTGGCGCA 467 

GTCG CCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCT 646 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GCAGAATCCAGACCTGCACAACGCCGAGTTGAGCAAGATGCTGGGCAAGTCGTGGAAGGC 527 

GCTCAAAGACAGCG-ACAAGATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGC 705 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 

GCTGAC-GCTGGCGGAGAAGCGGCCCTTCGTGGAGGAGGCAGAGCGGCTGCGCGTGCAGC 586 



Query 


522 


Sbjct 


67 


Query 


582 


Sbjct 


127 


Query 


639 


Sbjct 


184 


Query 


696 


Sbjct 


241 



Query 


530 


Sbjct 


411 


Query 


590 


Sbjct 


468 


Query 


647 


Sbjct 


528 


Query 


706 


Sbjct 


587 



ACATGGCTGACTACCCCGACTACAAGTACCGGCC 
I I I I I III I I I I I I I I I I I I I I I I I I I I I 
ACATGCAGGACCACCCCAACTACAAGTACCGGCC 



>ref |NM_005634.2 1 Homo sapiens SRY (sex determining region Y)-box 3 (S0X3), mRNA 

Length=2074 



)://blast.ncbi.nlm.nih.gov/Blast.( 



7/16/08 



NCBI Blast:X70683:H.sapiens mRNA for SOX-4 protein 

Score = 100 bits (110), Expect = 9e-18 
Identities = 154/216 (71%), Gaps = 8/216 (3%) 
Strand=Plus/Plus 

AAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGAT CATG 584 

II II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I II 

AAACGGCCCATGAACGCCTTCATGGTATGGTCCCGCGGGCAGCGGCGCAAAATGGCCCTG 477 

GAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCTyvGCGGCTGGGCAAACGCTGGAAG 644 
I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

GAGAAC CCCAAGATGCACAATTCTGAGATCAGCAAGCGCTTGGGCGCCGACTGGAAA 534 

CTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCCTCAA 703 
I I I I I I III I I I I I I I I I I I I I I II II II I II II II I I I I 
CTGCTGACCGACGCCGAGAAGCGACCATTCAT-CGACGAGGCCAAGCGACTTCGCGCCGT 593 

GCACATGGCT G AC T AC C C C G AC T AC AAG TACCGGCC 739 
I I I I I I I II II II I I I I I I I I I I II II II 
GCACATGAAGGAGTATCCGGACTACAAGTACCGACC 629 



Page 15 of 75 



Query 


528 


Sbjct 


418 


Query 


585 


Sbjct 


478 


Query 


645 


Sbjct 


535 


Query 


704 


Sbjct 


594 



>ref I NM_005986 . 2 I llE.^^^^ Homo sapiens SRY (sex determining region Y)-box 1 (SOXl), mRNA 
Length=4108 

GENE ID: 6656 SOXl I SRY (sex determining region Y)-box 1 [Homo sapiens] 

(10 or fewer PubMed links) 



Score = 96.9 bits (106), Expect = le-16 
Identities = 152/215 (70%), Gaps = 2/215 (0%) 
Strand=Plus/Plus 

TCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGG 585 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TCAAACGGCCCATGAACGCCTTCATGGTGTGGTCCCGCGGGCAGCGGCGCAAGATGGCCC 271 

AGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGC 645 



TGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCCTCAAG 
I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I 

TCATGTCCGAGGCCGAGAAGCGGCCGTTCA-TCGACGAGGCCAAGCGGCTGCGCGCGCTG 



Query 


526 


Sbjct 


212 


Query 


586 


Sbjct 


272 


Query 


646 


Sbjct 


332 


Query 


705 


Sbjct 


391 



1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



>ref I NM_003106 . 2 I HE^SSl Homo sapiens SRY (sex determining region Y)-box 2 (S0X2), mRNA 
Length=2518 

GENE ID: 6657 S0X2 I SRY (sex determining region Y)-box 2 [Homo sapiens] 
(Over 10 PubMed links) 



Score 
Ident 
Strani 



= 9] 
ties 
d=Pli 



Query 


526 


Sbjct 


549 


Query 


586 


Sbjct 


609 


Query 


646 


Sbjct 


669 


Query 


705 


Sbjct 


728 



.5 bits (100), Expect = 5e-15 
= 153/219 (69%), Gaps = 2/219 (0%) 
.s/Plus 

TCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGG 5 85 

TCAAGCGGCCCATGAATGCCTTCATGGTGTGGTCCCGCGGGCAGCGGCGCAAGATGGCCC 6 0 8 

AGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGC 6 45 

AGGAGAACCCCAAGATGCACAACTCGGAGATCAGCAAGCGCCTGGGCGCCGAGTGGAAAC 66 8 

TGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCCTCAAG 7 0 4 
I I III I I I I I I I I I I I I II I I I I I I I I I I I II I I I I 

TTTTGTCGGAGACGGAGAAGCGGCCGTTCA-TCGACGAGGCTAAGCGGCTGCGAGCGCTG 727 



CACATGGCTGACTACCCCGACTACAAGTACCGGCCCAGG 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
CACATGAAGGAGCACCCGGATTATAAATACCGGCCCCGG 



743 
766 



>ref |NM_005686.2 1 Homo sapiens SRY (sex determining region Y)-box 13 (S0X13), mRNA 

Length=4088 

GENE ID: 9580 S0X13 I SRY (sex determining region Y)-box 13 [Homo sapiens] 



)://blast.ncbi.nlm.nih.gov/Blast.( 



7/16/08 



NCBI Blast:X70683:H.sapiens mRNA for SOX-4 protein 

(Over 10 PubMed links) 
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Score =78.8 bits (86), Expect = 3e-ll 
Identities = 153/218 (70%), Gaps = 10/218 (4%) 
Strand=Plus/Plus 

Query 522 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 581 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Sbjct 1862 CACATCAAGAGGCCCATGAACGCCTTCATGGTGTGGGCCAAGGATGAGCGGAGGAAGATC 1921 

Query 582 ATG-GAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTG 640 

II III I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I 

Sbjct 1922 CTGCAAGCCTTC-CCAGACATGCACAACTCCAGCATCAGCAAGATCCTTGGATCTCGCTG 1980 

Query 641 GAAG CTGCTCAAAGACAGCGACAAG-ATCCCTTTCATTCGAGAGGCGGAGCGGCTGC 696 

I I I I II III I I I I I I I I I I I I I II III I I I I I I I I I 

Sbjct 1981 GAAGTCCATGACCAA — CCAG-GAGAAGCAGCCCTACTATGAG-GAACAGGCGCGGCTGA 2036 

Query 697 GCCTCAAGCACATGGCTGACTACCCCGACTACAAGTAC 734 

III I I I I I III I II II I I I I I I I I I I I I 
Sbjct 2037 GCCGGCAGCACCTGGAGAAGTATCCTGACTACAAGTAC 2074 



>ref |XM_001713691.1| PREDICTED: Homo sapiens hypothetical protein LOC100130809 (LOC10013 

mRNA 

Length=897 



Score = 64.4 bits (70), Expect = 7e-07 
Identities = 94/126 (74%), Gaps = 8/126 (6%) 
Strand=Plus/Plus 

AAGCGACCCATGAACGCCTTCATGGTGTGGTCGC-AGATCGAGCGGCGCAAGAT CAT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
AAGCGACCCATGAACGCATTCATCGTGTGGTCTCGCGATC-AGAGGCGCAAGATGGCTCT 



Query 


528 


Sbjct 


329 


Query 


584 


Sbjct 


388 


Query 


644 


Sbjct 


445 



>ref |NM_003140.1| Homo sapiens sex determining region Y (SRY), mRNA 

Length=897 



Score = 64.4 bits (70), Expect = 7e-07 
Identities = 94/126 (74%), Gaps = 8/126 (6%) 
Strand=Plus/Plus 

AAGCGACCCATGAACGCCTTCATGGTGTGGTCGC-AGATCGAGCGGCGCAAGAT CAT 583 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 
AAGCGACCCATGAACGCATTCATCGTGTGGTCTCGCGATC-AGAGGCGCAAGATGGCTCT 387 

GGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAA 643 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
AGAG-AATC-CCAGA-ATGCGAAACTCAGAGATCAGCAAGCAGCTGGGATACCAGTGGAA 444 



Query 


528 


Sbjct 


329 


Query 


584 


Sbjct 


388 


Query 


644 


Sbjct 


445 



>ref |NM_152989.2| Homo sapiens SRY (sex determining region Y)-box 5 (S0X5), transcri 

variant 2, mRNA 

Length=4563 

GENE ID: 6660 S0X5 I SRY (sex determining region Y)-box 5 [Homo sapiens] 
(Over 10 PubMed links) 



Score = 57.2 bits (62), Expect = le-04 
Identities = 76/103 (73%), Gaps = 2/103 (1%) 
Strand=Plus/Plus 



)://blast.ncbi.nlm.nih.gov/Blast.( 



7/16/08 



NCBI Blast:X70683:H.sapiens mRNA for SOX-4 protein 
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Query 522 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTC-GCAGATCGAGCGGCGCAAGAT 580 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II III I I I I I I 
Sbjct 1995 CACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGAT-GAACGGAGAAAGAT 2053 

Query 581 CATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAG 623 

II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 2054 CCTTCAAGCCTTTCCTGACATGCACAACTCCAACATCAGCAAG 2096 



>ref |NM_006940.4| ^3^M5l Homo sapiens SRY (sex determining region Y)-box 5 (S0X5), transcri 

variant 1, mRNA 

Length=4333 

GENE ID: 6660 S0X5 I SRY (sex determining region Y)-box 5 [Homo sapiens] 
(Over 10 PubMed links) 



Score = 57.2 bits (62), Expect = le-04 
Identities = 76/103 (73%), Gaps = 2/103 (1%) 
Strand=Plus/Plus 

Query 522 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTC-GCAGATCGAGCGGCGCAAGAT 580 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II III I I I I I I 
Sbjct 1765 CACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGAT-GAACGGAGAAAGAT 1823 

Query 581 CATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAG 623 

II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 1824 CCTTCAAGCCTTTCCTGACATGCACAACTCCAACATCAGCAAG 1866 



>ref |NM_178010.1| KSHSI Homo sapiens SRY (sex determining region Y)-box 5 (S0X5), transcri 

variant 3, mRNA 

Length=3095 

GENE ID: 6660 S0X5 I SRY (sex determining region Y)-box 5 [Homo sapiens] 
(Over 10 PubMed links) 



Score = 57.2 bits (62), Expect = le-04 
Identities = 76/103 (73%), Gaps = 2/103 (1%) 
Strand=Plus/Plus 

Query 522 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTC-GCAGATCGAGCGGCGCAAGAT 580 

I I I I I I I I I I II I II II I I I I I M I I I I I I I I I I II I II III I I II II 
Sbjct 527 CACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGAT-GAACGGAGAAAGAT 585 

Query 581 CATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAG 623 

II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 586 CCTTCAAGCCTTTCCTGACATGCACAACTCCAACATCAGCAAG 628 



>ref |NM_052876.2 I HiE^^Sl Homo sapiens BTB (POZ) domain containing 14B (BTBD14B) , mRNA 
Length=4556 

GENE ID: 112939 BTBD14B I BTB (POZ) domain containing 14B [Homo sapiens] 
(10 or fewer PubMed links) 



Score =51.8 bits (56), Expect = 0.004 
Identities = 34/38 (89%), Gaps = 0/38 (0%) 
St rand=Plus /Minus 

Query 1149 GCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Sbjct 67 GCCTCGGCCTCCGCCTCCGCGGCCTCCGCGGCCTCCGC 30 



>ref |NM_033326.2 I Homo sapiens SRY (sex determining region Y)-box 6 (S0X6), transcri 

variant 2, mRNA 

Length=5048 

GENE ID: 55553 S0X6 I SRY (sex determining region Y)-box 6 [Homo sapiens] 
(Over 10 PubMed links) 



Score =50.0 bits (54), Expect = 0.014 
Identities = 65/90 (72%), Gaps = 0/90 (0%) 
Strand=Plus/Plus 
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Query 522 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 581 

I I I I I I I I I I I I I I I I I I II I I I I II I I III I II III II I II III 
Sbjct 1859 CACATTAAGCGACCAATGAATGCATTCATGGTTTGGGCAAAGGATGAGAGGAGAAAAATC 1918 

Query 582 ATGGAGCAGTCGCCCGACATGCACAACGCC 611 

I II I I I I I I I I I I I I I M I I 
Sbjct 1919 CTTCAGGCCTTCCCCGACATGCATAACTCC 1948 



>ref I NM 017508 . 1 | Homo sapiens SRY (sex determining region Y)-box 6 (S0X6), transcript 

variant 1, mRNA 

Length=5158 

GENE ID: 55553 S0X6 I SRY (sex determining region Y)-box 6 [Homo sapiens] 
(Over 10 PubMed links) 



Score = 50.0 bits (54), Expect = 0.014 
Identities = 65/90 (72%), Gaps = 0/90 (0%) 
Strand=Plus/Plus 

Query 52 2 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 
I I I I I I I I I M M I I I I I II I I I I I I I I III I II III II I II III 
Sbjct 1969 CACATTAAGCGACCAATGAATGCATTCATGGTTTGGGCAAAGGATGAGAGGAGAAAAATC 

Query 582 ATGGAGCAGTCGCCCGACATGCACAACGCC 611 

I II I I I I I I I I I I I I I I I I I 
Sbjct 2029 CTTCAGGCCTTCCCCGACATGCATAACTCC 2058 



>ref |XM_001132155.2| IMH PREDICTED: Homo sapiens similar to Afadin (Protein AF-6) (LOC730031 
mRNA 

Length=1522 



Score = 46.4 bits (50), Expect = 0.18 
Identities = 31/35 (88%), Gaps = 0/35 (0%) 
Strand=Plus /Minus 

Query 1155 GCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 107 GCCTCCGCCTCCGCGGCTGCCTCCGCCTCCGCGGC 73 



>ref |NM_001040001 . 1 | Homo sapiens myeloid/lymphoid or mixed-lineage leukemia (tritho 

homolog, Drosophila) ; translocated to, 4 (MLLT4), transcript 
variant 1, mRNA 
Length=7762 

GENE ID: 4301 MLLT4 I myeloid/lymphoid or mixed-lineage leukemia (trithorax 
homolog, Drosophila) ; translocated to, 4 [Homo sapiens] (Over 10 PubMed links) 



Score =46.4 bits (50), Expect =0.18 
Identities = 31/35 (88%), Gaps = 0/35 (0%) 
Strand=Plus /Minus 

Query 1155 GCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 38 GCCTCCGCCTCCGCGGCTGCCTCCGCCTCCGCGGC 4 



>ref I NM_001040000 . 1 I lllX-^^^S Homo sapiens myeloid/lymphoid or mixed-lineage leukemia (tritho 
homolog, Drosophila); translocated to, 4 (MLLT4), transcript 
variant 2, mRNA 
Length=7551 

GENE ID: 4301 MLLT4 I myeloid/lymphoid or mixed-lineage leukemia (trithorax 
homolog, Drosophila) ; translocated to, 4 [Homo sapiens] (Over 10 PubMed links) 



Score = 46.4 bits (50), Expect = 0.18 
Identities = 31/35 (88%), Gaps = 0/35 (0%) 
Strand=Plus /Minus 

Query 1155 GCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 38 GCCTCCGCCTCCGCGGCTGCCTCCGCCTCCGCGGC 4 
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>ref I NM_005936 . 2 | Homo sapiens myeloid/lymphoid or mixed-lineage leukemia (trithorax 

homolog, Drosophila) ; translocated to, 4 (MLLT4), transcript 
variant 3, mRNA 
Length=4981 

GENE ID: 4301 MLLT4 I myeloid/lymphoid or mixed-lineage leukemia (trithorax 
homolog, Drosophila) ; translocated to, 4 [Homo sapiens] (Over 10 PubMed links) 



Score = 46.4 bits (50), Expect = 0.18 
Identities = 31/35 (88%), Gaps = 0/35 (0%) 
Strand=Plus /Minus 

Query 1155 GCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 38 GCCTCCGCCTCCGCGGCTGCCTCCGCCTCCGCGGC 4 



>ref |NM_015156.2 1 HH^D Homo sapiens REST corepressor 1 (RCORl) , mRNA 
Length=5537 

GENE ID: 23186 RCORl I REST corepressor 1 [Homo sapiens] (Over 10 PubMed links) 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 35/42 (83%), Gaps = 0/42 (0%) 
Strand=Plus/Plus 

Query 1148 CGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 126 CGCCTCGGGCGCCGCCGCCTCCTCAGCCTCGGCCGCCGCCGC 167 



>ref I NT_007592 . 14 I Hs6_7749 S Homo sapiens chromosome 6 genomic contig, reference assembly 
Length=48945890 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 4 



Score = 5012 bits (5558), Expect =0.0 
Identities = 2789/2793 (99%), Gaps = 2/2793 (0%) 
Strand=Plus/Plus 

CCAGCATTCGAGAAACTCCTCTCTACTTTAGCACGGTCTCCAGACTCAGCCGAGAGACAG 6 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCAGCATTCGAGAAACTCCTCTCTACTTTAGCACGGTCTCCAGACTCAGCCGAGAGACAG 1245: 

CAAACTGc-:KK;qcaatqaqaaaacqaqaaaa-iqqqaaaa-iqaqacTCTCCAGCCTGGGAA 124 
I I I I I I I I I M M 1 I I M M M I I M M M I I ri M M I I M I I I I I I I I I I I I I I I I I I 
CAAACTGCAGCGCGGTGAGAGAGCGAGAGAGAGGGAGAGAGAGACTCTCCAGCCTGGGAA 1245: 

CTATAACTCCTCTGCGAGAGGCGGAGAACTCCTTCCCCAAATCTTTTGGGGACTTTTCTC 184 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTATAACTCCTCTGCGAGAGGCGGAGAACTCCTTCCCCAAATCTTTTGGGGACTTTTCTC 1245: 

TCTTTACCCACCTCCGCCCCTGCGAGGAGTTGAGGGGCCAGTTCGGCCGCCGCGCGCGTC 244 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCTTTACCCACCTCCGCCCCTGCGAGGAGTTGAGGGGCCAGTTCGGCCGCCGCGCGCGTC 1245: 

TTCCCGTTCGGCGTGTGCTTGGCCCGGGGAACCGGGAGGGCCCGGCGATCGCGCGGCGGC 304 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TTCCCGTTCGGCGTGTGCTTGGCCCGGGGAACCGGGAGGGCCCGGCGATCGCGCGGCGGC 1245: 

CGCCGCGAGGGTGTGAGCGCGCGTGGGCGCCCGCCGAGCCGAGGCCATGGTGCAGCAAAC 364 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGCCGCGAGGGTGTGAGCGCGCGTGGGCGCCCGCCGAGCCGAGGCCATGGTGCAGCAAAC 1245: 

CAACAATGCCGAGAACACGGAAGCGCTGCTGGCCGGCGAGAGCTCGGACTCGGGCGCCGG 424 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAACAATGCCGAGAACACGGAAGCGCTGCTGGCCGGCGAGAGCTCGGACTCGGGCGCCGG 1245: 

CCTCGAGCTGGGAATCGCCTCCTCCCCCACGCCCGGCTCCACCGCCTCCACGGGCGGCAA 484 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCTCGAGCTGGGAATCGCCTCCTCCCCCACGCCCGGCTCCACCGCCTCCACGGGCGGCAA 12 45: 

GGCCGACGACCCGAGCTGGTGCAAGACCCCGAGTGGGCACATCAAGCGACCCATGAACGC 544 



Query 


5 


Sbjct 


12452671 


Query 


65 


Sbjct 


12452731 


Query 


125 


Sbjct 


12452791 


Query 


185 


Sbjct 


12452851 


Query 


245 


Sbjct 


12452911 


Query 


305 


Sbjct 


12452971 


Query 


365 


Sbjct 


12453031 


Query 


425 


Sbjct 


12453091 


Query 


485 
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12453151 




545 


sbjit 


12453211 




605 


sbjit 


12453271 




665 


sbjit 


12453331 




725 


sbjit 


12453391 




785 


sbjit 


12453451 




845 


sbjit 


12453511 




905 


sbjit 


12453571 




965 


sbjit 


12453631 




1025 


sbjit 


12453691 




1085 


sbjit 


12453751 




1145 


sbjit 


12453811 




1205 


sbjit 


12453871 




1265 


sbjit 


12453931 




1325 


sbjit 


12453991 




1385 


Sbjct 


12454051 




1445 


sbjit 


12454111 




1505 


sbjit 


12454171 




1565 


sbjit 


12454231 




1625 


Sbjct 


12454291 






Sbjct 


12454351 



CTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGCA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGCA 

CAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAA 

GATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGA 

CTACAAGTACCGGCCCAGGAAGAAGGTGAAGTCCGGCAACGCCAACTCCAGCTCCTCGGC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTACAAGTACCGGCCCAGGAAGAAGGTGAAGTCCGGCAACGCCAACTCCAGCTCCTCGGC 

CGCCGCCTCCTCCAAGCCGGGGGAGoo.^ - -.v^.^ - v. ^ ^ . ; ; t . oo - 
I I I I I I I I I I I I I I I I I I I I I I I I I I I 11 1 I I I I I I 11 I I ri I 11 I I I I I M I I I I ]■ I I I 
CGCCGCCTCCTCCAAGCCGGGGGAGAAGGGAGACAAGGTCGGTGGCAGTGGCGGGGGCGG 

TTT i7 11'tTTTT i11l7TTT f ItTTTT I Ti^'niT i7 11'tTTT fT T tTTT 1 1 

CCATGGGGGCGGCGGCGGCGGCGGGAGCAGCAACGCGGGGGGAGGAGGCGGCGGTGCGAG 

• -! ^ .-o.^ ■! ^CCAACTCCAAACCGGCGCAGAAAAAGAGCTGCGGCTCCAAAGTGGCGGG 
I 1 r I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGGCGGCGGCGCCAACTCCAAACCGGCGCAGAAAAAGAGCTGCGGCTCCAAAGTGGCGGG 

CGGCGCGGGCGGTGGGGTTAGCAAACCGCACGCCAAGCTCATCCTGGCAqqcqqC'-SGcqg 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I' I I I I I i I I I I 
CGGCGCGGGCGGTGGGGTTAGCAAACCGCACGCCAAGCTCATCCTGGCAGGCGGCGGCGG 

cqqccsaGAAAGCAGCGGCTqccaccqccacctcct: tcaccqccaaac-sqqcaaagqccqc 
I ri I 11 I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I 1 I I ]■ I I I I I I I ]■ I I I f I I I I I M 
CGGCGGGAAAGCAGCGGCTGCCGCCGCCGCCTCCTTCGCCGCCGAACAGGCGGGGGCCGC 

cqccctqctqcccctqqqcaccqccqcC'-sACCACCACTCGCTGTACAAGGCGCGGACTCC 
I I I I I I I I 11 I I I I I I 'I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGCCCTGCTGCCCCTGGGCGCCGCCGCCGACCACCACTCGCTGTACAAGGCGCGGACTCC 

CAGCGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGCGCTCGCGGCCCCGGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAGCGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGCGCTCGCGGCCCCGGG 

CAAGCACCTGGCGGAGAAGAAGGTGAAGCGCGTCTACCTGTTCGGCGGCCTGGGCACGTC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAAGCACCTGGCGGAGAAGAAGGTGAAGCGCGTCTACCTGTTCGGCGGCCTGGGCACGTC 

GTCGTCGCCCGTGGGCGGCGTGGGCGCGGGAGCCGACCCCAGCGACCCCCTGGGCCTGTA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTCGTCGCCCGTGGGCGGCGTGGGCGCGGGAGCCGACCCCAGCGACCCCCTGGGCCTGTA 

CGAGGAGGAGGGCGCGGGCTGCTcqccC'-3;:;cqcqccc;:;qcct:.QaGcqqccac;:;qcaqcac 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I [ I I I' I I I I [ I 
CGAGGAGGAGGGCGCGGGCTGCTCGCCCGACGCGCCCAGCCTGAGCGGCCGCAGCAGCGC 

TT i' T I'TtTTi i' T TtTTTi I'ftTTTT i i? TTTTTi fi TTTTT f I'TTTTt i f T TTTTT i f I'T 

CGCCTCGTCCCCCGCCGCCGGCCGCTCGCCCGCCGACCACCGCGGCTACGCCAGCCTGCG 

CGCcqcctcacccqccccqtccaacacqcccTCGCACGCGtcctcctcaqcc;:.catccca 
I I' I I 'I I I I I I' I I I I I I I I T I I I I i I I' I I I I I I I I I I I I I I I I I I I I I I I'l' I I I I [ I I I I I 
CGCCGCCTCGCCCGCCCCGTCCAGCGCGCCCTCGCACGCGTCCTCCTCGGCCTCGTCCCA 

i' I'TT 1 1 i' 1 1' I TTT I i' T I TTTTT i ?T I'TT i" f 1 1 TTTTTTTTTTTTT 1 1 i'T'TTtTTT?t??I 

CTCCTCCTCTTCCTCCTCCTCGGGCTCCTCGTCCTCCGACGACGAGTTCGAAGACGACCT 

GCTCGACCTGAACCCCAGCTCAAACTTTGAGAGCATGTCCCTGGGCAGCTTCAGTTCGTC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCTCGACCTGAACCCCAGCTCAAACTTTGAGAGCATGTCCCTGGGCAGCTTCAGTTCGTC 

GTCGGCGCTCGACCGGGACCTGGATTTTAACTTCGAGCCCGGCTCCGGCTCGCACTTCGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTCGGCGCTCGACCGGGACCTGGATTTTAACTTCGAGCCCGGCTCCGGCTCGCACTTCGA 

GTTCCCGGACTACTGCACGCCCGAGGTGAGCGAGATGATCTCGGGAGACTGGCTCGAGTC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTTCCCGGACTACTGCACGCCCGAGGTGAGCGAGATGATCTCGGGAGACTGGCTCGAGTC 



12453210 
604 

12453270 
664 

12453330 
724 

12453390 
784 

12453450 
844 

12453510 
904 



12453630 
1024 

12453690 
1084 

12453750 
1144 

12453810 
1204 

12453870 
1264 

12453930 
1324 

12453990 
1384 

12454050 
1444 

12454110 
1504 

12454170 
1564 

12454230 
1624 

12454290 
1684 

12454350 
1744 

12454410 
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Query 


1745 


Sbjct 


12454411 


Query 


1805 


Sbjct 


12454471 


Query 


1865 


Sbjct 


12454531 


Query 


1924 


Sbjct 


12454591 


Query 


1984 


Sbjct 


12454651 


Query 


2044 


Sbjct 


12454711 


Query 


2103 


Sbjct 


12454771 


Query 


2163 


Sbjct 


12454831 


Query 


2223 


Sbjct 


12454891 


Query 


2283 


Sbjct 


12454951 


Query 


2343 


Sbjct 


12455011 


Query 


2403 


Sbjct 


12455071 


Query 


2463 


Sbjct 


12455131 


Query 


2523 


Sbjct 


12455191 


Query 


2583 


Sbjct 


12455251 


Query 


2643 


Sbjct 


12455311 


Query 


2703 


Sbjct 


12455371 


Query 


2763 


Sbjct 


12455431 



CAGCATCTCCAACCTGGTTTTCACCTACTGAAGGGCGCGCAGGCAGGGAGAAGGGCCqoo- 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
CAGCATCTCCAACCTGGTTTTCACCTACTGAAGGGCGCGCAGGCAGGGAGAAGGGCCGGG 

(Kj;5aaTAGGAGAGGAc:;6 aaaa,;:-{:aaafcaaa.;:aaaaaa.;;« ccjaaa.uCic:;a caaaccia acjaat 
I I I I I I I I I I I I I I I I I I I I I I l l I I I I I I I I I I I I M I I I M I I I I M I I I M I M I 
GGGGGTAGGAGAGGAGAAAAAAAAAGTGAAAAAAAGAAACGAAAAGGACAGACGAAGAGT 

.v^^ aa . . >oa ^' . .-a - ^ ^CTCG-TTCGCCCGCGTTCT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TTAAAGAGAAAAGGGAAAAAAGAAAGAAAAAGTAAGCAGGGCTGGCTTCGCCCGCGTTCT 

CGTCGTCGGATCAAGGAGCGCGGCGGCGTTTTGGACCCGCGCTCCCATCCCCCACCTTCC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGTCGTCGGATCAAGGAGCGCGGCGGCGTTTTGGACCCGCGCTCCCATCCCCCACCTTCC 

CGGGCCGGGGACCCACTCTGCCCAGCCGGAGGGACGCGGAGGAGGAAGAGGGTAGACAGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGGGCCGGGGACCCACTCTGCCCAGCCGGAGGGACGCGGAGGAGGAAGAGGGTAGACAGG 

GGCGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATGGC- . . .aa . . GCGACTT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGCGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATGGCAAAAAAAAAAAAGCGACTT 

CGAGTTTGCTCCCCTTTGCTTGAAGAGACCCCCTCCCCCTTCCAACGAGCTTCCGGACTT 
I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGAGTTTGCTCCCCTTTGCTTGAAGAGACCCCCTCCCCCTTCCAACGAGCTTCCGGACTT 

GTCTGCACCCCCAGCAAGAAGGCGAGTTAGTTTTCTAGAGACTTGAAGGAGTCTCCCCCT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTCTGCACCCCCAGCAAGAAGGCGAGTTAGTTTTCTAGAGACTTGAAGGAGTCTCCCCCT 

TCCTGCATCACCACCTTGGTTTTGTTTTATTTTGCTTCTTGGTCAAGAAAGGAGGGGAGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCCTGCATCACCACCTTGGTTTTGTTTTATTTTGCTTCTTGGTCAAGAAAGGAGGGGAGA 

ACCCAGCGCACCCCTccccccct;: ;: " tttAAACGCGTGATGAAGACAGAAGGCTCCGGGG 
I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACCCAGCGCACCCCTCCCCCCCTTTTTTTAAACGCGTGATGAAGACAGAAGGCTCCGGGG 

TGACGAATTTGGCCGATGGCAGATGTTTTGGGGGAACGCCGGGACTGAGAGACTCCACGC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGACGAATTTGGCCGATGGCAGATGTTTTGGGGGAACGCCGGGACTGAGAGACTCCACGC 



AGGCGAATTCCCGTTTGGGGCCv- 



:actt; ■: ;: CCTCCCTCTTTTCCCCTTGCCCCCTCTGCAG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGGCGAATTCCCGTTTGGGGCTTTTTTTTCCTCCCTCTTTTCCCCTTGCCCCCTCTGCAG 

CCGGAGGAGGAGATGTTGAGGGGAGGAGGCCAGCCAGTGTGACCGGCGCTAGGAAATGAC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCGGAGGAGGAGATGTTGAGGGGAGGAGGCCAGCCAGTGTGACCGGCGCTAGGAAATGAC 

CCGAGAACCCCGTTGGAAGCGCAGCAGCGGGAGCTAGGGGCGGGGGCGGAGGAGGACACG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCGAGAACCCCGTTGGAAGCGCAGCAGCGGGAGCTAGGGGCGGGGGCGGAGGAGGACACG 

AACTGGAAGGGGGTTCACGGTCAAACTGAAATGGATTTGCACGTTGGGGAGCTGGCGGCG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AACTGGAAGGGGGTTCACGGTCAAACTGAAATGGATTTGCACGTTGGGGAGCTGGCGGCG 

GCGGCTGCTGGGCCTCCGCCTTCTTTTCTACGTGAAATCAGTGAGGTGAGACTTCCCAGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GCGGCTGCTGGGCCTCCGCCTTCTTTTCTACGTGAAATCAGTGAGGTGAGACTTCCCAGA 

CCCCGGAGGCGTGGAGGAGAGGAGACTGTTTGATGTGGTACAGGGGCAGTCAGTGGAGGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCCCGGAGGCGTGGAGGAGAGGAGACTGTTTGATGTGGTACAGGGGCAGTCAGTGGAGGG 

CGAGTGGTTTCGG G 2795 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGAGTGGTTTCGGAAAAAAAAAAAGAAAAAAAG 12455463 



Page 21 of 75 

1804 

12454470 
1864 

12454530 
1923 

12454590 
1983 

12454650 
2043 

12454710 
2102 

12454770 
2162 

12454830 
2222 

12454890 
2282 

12454950 
2342 

12455010 
2402 

12455070 
2462 

12455130 
2522 

12455190 
2582 

12455250 
2642 

12455310 
2702 

12455370 
2762 

12455430 



Features in this part of subject sequence: 

butyrophilin, subfamily 2, member Al isoform 1 precursor 
butyrophilin, subfamily 2, member Al isoform 2 precursor 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 30/35 (85%), Gaps = 0/35 (0%) 
Strand=Plus/Plus 

Query 2 048 ACCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2 082 



)://blast.ncbi.nlm.nih.gov/Blast.( 



7/16/08 



NCBI Blast:X70683:H.sapiens mRNA for SOX-4 protein Page 22 of 75 



Features flanking this part of subject sequence: 
12955 bp at 5' side: Rh-associated glycoprotein 
42967 bp at 3' side: cysteine-rich secretory protein 2 

Score =41.0 bits (44), Expect =7.5 
Identities = 33/40 (82%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGC>::>::?iaaaaaa>::a 2095 

I I I I I I I I III I I I I I I I I I I I I I III I I I I II 
Sbjct 40475750 TTGTTGTTGTTGCTGTTGTTGTTGTTAGGAAACAAAACAA 40475711 



>ref I NW_001838973 . 1 1 Hs6_WGA366_36 S Homo sapiens chromosome 6 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188126 ) 

Length=20334030 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 4 



Score = 1570 bits (1740), Expect =0.0 
Identities = 870/870 (100%), Gaps = 0/870 (0%) 
Strand=Plus/Plus 



Query 


5 


Sbjct 


17883264 


Query 


65 


Sbjct 


17883324 


Query 


125 


Sbjct 


17883384 


Query 


185 


Sbjct 


17883444 


Query 


245 


Sbjct 


17883504 


Query 


305 


Sbjct 


17883564 


Query 


365 


Sbjct 


17883624 


Query 


425 


Sbjct 


17883684 


Query 


485 


Sbjct 


17883744 


Query 


545 


Sbjct 


17883804 


Query 


605 


Sbjct 


17883864 


Query 


665 


Sbjct 


17883924 



CCAGCATTCGAGAAACTCCTCTCTACTTTAGCACGGTCTCCAGACTCAGCCGAGAGACAG 6 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CCAGCATTCGAGAAACTCCTCTCTACTTTAGCACGGTCTCCAGACTCAGCCGAGAGACAG 178 83323 

CAAACTG o.'O v, i^-.^o.^ -i i^-.^o.^ -i ^-.^o.-i -i i^-.^oTCTCCAGCCTGGGAA 124 
I I I I I I I I I j I I I 'lM I i I I 11 I i I i I I 11 I i I i j I 11 I i I i I I I I I I I I I I I I I I I I I I I 

CAAACTGCAGCGCGGTGAGAGAGCGAGAGAGAGGGAGAGAGAGACTCTCCAGCCTGGGAA 17883383 

CTATAACTCCTCTGCGAGAGGCGGAGAACTCCTTCCCCAAATCTTTTGGGGACTTTTCTC 184 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CTATAACTCCTCTGCGAGAGGCGGAGAACTCCTTCCCCAAATCTTTTGGGGACTTTTCTC 17883443 

TCTTTACCCACCTCCGCCCCTGCGAGGAGTTGAGGGGCCAGTTCGGCCGCCGCGCGCGTC 244 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TCTTTACCCACCTCCGCCCCTGCGAGGAGTTGAGGGGCCAGTTCGGCCGCCGCGCGCGTC 17883503 

TTCCCGTTCGGCGTGTGCTTGGCCCGGGGAACCGGGAGGGCCCGGCGATCGCGCGGCGGC 304 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TTCCCGTTCGGCGTGTGCTTGGCCCGGGGAACCGGGAGGGCCCGGCGATCGCGCGGCGGC 17883563 

CGCCGCGAGGGTGTGAGCGCGCGTGGGCGCCCGCCGAGCCGAGGCCATGGTGCAGCAAAC 364 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CGCCGCGAGGGTGTGAGCGCGCGTGGGCGCCCGCCGAGCCGAGGCCATGGTGCAGCAAAC 17883623 

CAACAATGCCGAGAACACGGAAGCGCTGCTGGCCGGCGAGAGCTCGGACTCGGGCGCCGG 424 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CAACAATGCCGAGAACACGGAAGCGCTGCTGGCCGGCGAGAGCTCGGACTCGGGCGCCGG 17883683 

CCTCGAGCTGGGAATCGCCTCCTCCCCCACGCCCGGCTCCACCGCCTCCACGGGCGGCAA 484 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CCTCGAGCTGGGAATCGCCTCCTCCCCCACGCCCGGCTCCACCGCCTCCACGGGCGGCAA 17883743 

GGCCGACGACCCGAGCTGGTGCAAGACCCCGAGTGGGCACATCAAGCGACCCATGAACGC 544 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GGCCGACGACCCGAGCTGGTGCAAGACCCCGAGTGGGCACATCAAGCGACCCATGAACGC 17883803 

CTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGCA 604 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGCA 17883863 

CAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAA 664 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAA 17883923 

GATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGA 724 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGA 17883983 



)://blast.ncbi.nlm.nih.gov/Blast.( 
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NCBI Blast:X70683:H.sapiens mRNA for SOX-4 protein Page 23 of 75 

Query 725 CTACAAGTACCGGCCCAGGAAGAAGGTGAAGTCCGGCAACGCCAACTCCAGCTCCTCGGC 784 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct 17883984 CTACAAGTACCGGCCCAGGAAGAAGGTGAAGTCCGGCAACGCCAACTCCAGCTCCTCGGC 17884043 

Query 785 CGCCGCCTCCTCCAAGCCGGGGGAGAAGGGAGACAAGGTCGGTGGCAGTGGCGGGGGCGG 844 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct 17884044 CGCCGCCTCCTCCAAGCCGGGGGAGAAGGGAGACAAGGTCGGTGGCAGTGGCGGGGGCGG 17884103 



Query 
Sbjct 



CCATGGGGGCGGCGGCGGCGGCGGGAGCAG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCATGGGGGCGGCGGCGGCGGCGGGAGCAG 



Features in this part of subject sequence: 
SRY (sex determining region Y)-box 4 



Score = 1487 bits (1648), Expect = 0.0 
Identities = 829/831 (99%), Gaps = 1/831 (0%) 
Strand=Plus/Plus 





1966 


Sbjct 


17884827 




2026 


Sbjct 


17884887 




2086 


Sbjct 


17884947 






Sbjct 


17885007 




2205 


Sbjct 


17885067 




2265 


Sbjct 


17885127 




2325 


Sbjct 


17885187 




2385 


Sbjct 


17885247 






Sbjct 


17885307 


Query 


2505 


Sbjct 


17885367 


Query 


2565 


Sbjct 


17885427 


Query 


2625 


Sbjct 


17885487 


Query 


2685 


Sbjct 


17885547 


Query 


2745 


Sbjct 


17885607 



TCCCATCCCCCACCTTCCCGGGCCGGGGACCCACTCTGCCCAGCCGGAGGGACGCGGAGG 2025 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCCCATCCCCCACCTTCCCGGGCCGGGGACCCACTCTGCCCAGCCGGAGGGACGCGGAGG 17884886 

AGGAAGAGGGTAGACAGGGGCGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATGGCa 2085 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGGAAGAGGGTAGACAGGGGCGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATGGCA 17884946 

a aaaaa>::?i a a-GCGACTTCGAGTTTGCTCCCCTTTGCTTGAAGAGACCCCCTCCCCCTTC 2144 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AAAAAAAAAAAGCGACTTCGAGTTTGCTCCCCTTTGCTTGAAGAGACCCCCTCCCCCTTC 17885006 

CAACGAGCTTCCGGACTTGTCTGCACCCCCAGCAAGAAGGCGAGTTAGTTTTCTAGAGAC 220 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CAACGAGCTTCCGGACTTGTCTGCACCCCCAGCAAGAAGGCGAGTTAGTTTTCTAGAGAC 17885066 

TTGAAGGAGTCTCCCCCTTCCTGCATCACCACCTTGGTTTTGTTTTATTTTGCTTCTTGG 226 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TTGAAGGAGTCTCCCCCTTCCTGCATCACCACCTTGGTTTTGTTTTATTTTGCTTCTTGG 17885126 

TCAAGAAAGGAGGGGAGAACCCAGCGCACCCCTcccccccttttfcfctAAACGCGTGATGA 2324 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCAAGAAAGGAGGGGAGAACCCAGCGCACCCCTCCCCCCCTTTTTTTAAACGCGTGATGA 17885186 

AGACAGAAGGCTCCGGGGTGACGAATTTGGCCGATGGCAGATGTTTTGGGGGAACGCCGG 238 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGACAGAAGGCTCCGGGGTGACGAATTTGGCCGATGGCAGATGTTTTGGGGGAACGCCGG 178 85246 

GACTGAGAGACTCCACGCAGGCGAATTCCCGTTTGGGGCCttttttrCCTCCCTCTTTTC 2444 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GACTGAGAGACTCCACGCAGGCGAATTCCCGTTTGGGGCTTTTTTTTCCTCCCTCTTTTC 178 85306 

CCCTTGCCCCCTCTGCAGCCGGAGGAGGAGATGTTGAGGGGAGGAGGCCAGCCAGTGTGA 250 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCCTTGCCCCCTCTGCAGCCGGAGGAGGAGATGTTGAGGGGAGGAGGCCAGCCAGTGTGA 17885366 

CCGGCGCTAGGAAATGACCCGAGAACCCCGTTGGAAGCGCAGCAGCGGGAGCTAGGGGCG 256 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCGGCGCTAGGAAATGACCCGAGAACCCCGTTGGAAGCGCAGCAGCGGGAGCTAGGGGCG 17885426 

GGGGCGGAGGAGGACACGAACTGGAAGGGGGTTCACGGTCAAACTGAAATGGATTTGCAC 262 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGGGCGGAGGAGGACACGAACTGGAAGGGGGTTCACGGTCAAACTGAAATGGATTTGCAC 17885486 

GTTGGGGAGCTGGCGGCGGCGGCTGCTGGGCCTCCGCCTTCTTTTCTACGTGAAATCAGT 268 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTTGGGGAGCTGGCGGCGGCGGCTGCTGGGCCTCCGCCTTCTTTTCTACGTGAAATCAGT 178 85546 

GAGGTGAGACTTCCCAGACCCCGGAGGCGTGGAGGAGAGGAGACTGTTTGATGTGGTACA 274 4 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GAGGTGAGACTTCCCAGACCCCGGAGGCGTGGAGGAGAGGAGACTGTTTGATGTGGTACA 17885606 

GGGGCAGTCAGTGGAGGGCGAGTGGTTTCGGas:.3aaaaaa-u.3qaaaaa-::.:iG 2795 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GGGGCAGTCAGTGGAGGGCGAGTGGTTTCGGAAAAAAAAAAAGAAAAAAAG 17885657 



>ref |NW_001838765. 1 |Hs2_WGA158_36 II 
(based on HuRef SCAF_1103279188139 ) 
Length=2797789 



i Homo sapiens chromosome 2 genomic contig, alternate asse 



http://blast.ncbi.nlm.nih.gov/Blast.cgi 



NCBI Blast:X70683:H.sapiens mRNA for SOX-4 protein 



this part of subject sequence: 



Page 24 of 75 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 



Score = 309 bits (342), Expect = le-80 
Identities = 224/259 (86%), Gaps = 0/259 (0%) 
Strand=Plus/Plus 

CGACCCGAGCTGGTGCAAGACCCCGAGTGGGCACATCAAGCGACCCATGAACGCCTTCAT 55 0 
I I I I I I I I I I I I I I I I I I II II I I I II I I I I I I II I I I I I I I I I I I I I 
CGACCCAGACTGGTGCAAGACGGCGTCGGGCCACATCAAGCGGCCGATGAACGCGTTCAT 71964 0 

GGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGCACAACGC 610 
III I I I I I I I I I I I I II I I I II II II II I I I I I I I II I I I I I I I I I I I I I I 
GGTATGGTCCAAGATCGAACGCAGGAAGATCATGGAGCAGTCTCCGGACATGCACAACGC 719 70 0 

CGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAAGATCCC 6 7 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I 
CGAGATCTCCAAGAGGCTGGGCAAGCGCTGGAAAATGCTGAAGGACAGCGAGAAGATCCC 719 760 

TTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAA 73 0 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I 
GTTCATCCGGGAGGCGGAGCGGCTGCGGCTCAAGCACATGGCCGACTACCCCGACTACAA 71982 0 

GTACCGGCCCAGGAAGAAG 749 
I I I I I I I I I I I I I I III 
GTACCGGCCCCGGAAAAAG 719839 



Query 


491 


Sbjct 


719581 


Query 


551 


Sbjct 


719641 


Query 


611 


Sbjct 


719701 


Query 


671 


Sbjct 


719761 


Query 


731 


Sbjct 


719821 



Score = 131 bits (144), Expect = 5e-27 
Identities = 96/112 (85%), Gaps = 0/112 (0%) 
Strand=Plus/Plus 

GGCTCGCACTTCGAGTTCCCGGACTACTGCACGCCCGAGGTGAGCGAGATGATCTCGGGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I 
GGCTCCCACTTCGAGTTCCCCGACTACTGCACGCCGGAGCTGAGCGAGATGATCGCGGGG 

GACTGGCTCGAGTCCAGCATCTCCAACCTGGTTTTCACCTACTGAAGGGCGC 1782 
I I I I I I I I III I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
GACTGGCTGGAGGCGAACTTCTCCGACCTGGTGTTCACATATTGAAAGGCGC 720803 



Query 


1671 


Sbjct 


720692 


Query 


1731 


Sbjct 


720752 



>ref I NT_005334 . 15 I Hs2_5491 S Homo sapiens chromosome 2 genomic contig, reference assembly 
Length=11088087 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY-box 11 



Score = 309 bits (342), Expect = le-80 
Identities = 224/259 (86%), Gaps = 0/259 (0%) 
Strand=Plus/Plus 



Query 


491 


CGACCCGAGCTGGTGCAAGACCCCGAGTGGGCACATCAAGCGACCCATGAACGCCTTCAT 


550 


Sbjct 


667099 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II II M II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 
CGACCCAGACTGGTGCAAGACGGCGTCGGGCCACATCAAGCGGCCGATGAACGCGTTCAT 


667158 


Query 


551 


GGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGCACAACGC 


610 


Sbjct 


667159 


III 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GGTATGGTCCAAGATCGAACGCAGGAAGATCATGGAGCAGTCTCCGGACATGCACAACGC 


667218 


Query 


611 


CGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAAGATCCC 


670 


Sbjct 


667219 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CGAGATCTCCAAGAGGCTGGGCAAGCGCTGGAAAATGCTGAAGGACAGCGAGAAGATCCC 


667278 


Query 
Sbjct 


671 

667279 


TTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAA 
1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GTTCATCCGGGAGGCGGAGCGGCTGCGGCTCAAGCACATGGCCGACTACCCCGACTACAA 


730 

667338 


Query 


731 


GTACCGGCCCAGGAAGAAG 749 





)://blast.ncbi.nlm.nih.gov/Blast.( 
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NCBI Blast:X70683:H.sapiens mRNA for SOX-4 protein Page 25 of 75 

I I I I I I I I I I I I I I III 

Sbjct 667339 GTACCGGCCCCGGAAAAAG 667357 

Features in this part of subject sequence: 

SRY-box 11 

Score = 131 bits (144), Expect = 5e-27 
Identities = 96/112 (85%), Gaps = 0/112 (0%) 
Strand=Plus/Plus 

Query 1671 GGCTCGCACTTCGAGTTCCCGGACTACTGCACGCCCGAGGTGAGCGAGATGATCTCGGGA 1730 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I 
Sbjct 668210 GGCTCCCACTTCGAGTTCCCCGACTACTGCACGCCGGAGCTGAGCGAGATGATCGCGGGG 668269 

Query 1731 GACTGGCTCGAGTCCAGCATCTCCAACCTGGTTTTCACCTACTGAAGGGCGC 1782 

I I I I I I I I III I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Sbjct 668270 GACTGGCTGGAGGCGAACTTCTCCGACCTGGTGTTCACATATTGAAAGGCGC 668321 

>ref |NW_001838652.1|Hs20_WGA1211_36 ilfl Homo sapiens chromosome 20 genomic contig, alternate a 

(based on HuRef SCAF_1103279188366 ) 

Length=22415471 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 12 

Score = 291 bits (322), Expect = 3e-75 
Identities = 213/246 (86%), Gaps = 1/246 (0%) 
Strand=Plus/Plus 

CCCGAGCTGGTGCAAGACCCCGAGTGGGCACATCAAGCGACCCATGAACGCCTTCATGGT 553 

I I I I I II II I I I I I I I I I II II I II I II II II II I II I I I I I I I I II II I I I I 
CCCG-GCTGGTGCAAGACCCCGAGCGGCCACATC7VAGAGGCCGATGAACGCATTCATGGT 25813 9 

GTGGTCGCAGATCGAGCGGCGC7VAGATCATGGAGCAGTCGCCCGACATGCACAACGCCGA 613 
I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTGGTCGCAGCACGAACGGCGGAAGATCATGGACCAGTGGCCCGACATGCACAACGCCGA 25819 9 

GATCTCCAAGCGGCTGGGCAAACGCTGG7VAGCTGCTCAAAGACAGCGACAAGATCCCTTT 6 73 
I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I III II I I I I I I I I II 
GATCTCCAAGCGCCTGGGCCGCCGCTGGCAGCTGCTGCAGGACTCGGAGAAGATCCCGTT 258259 

CATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTA 733 
I I II I I I I I I I II II II II II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
CGTGCGGGAGGCGGAGCGGCTGCGGCTCAAGCACATGGCGGATTACCCGGACTACAAGTA 258319 

CCGGCC 739 
I I I I I I 

CCGGCC 258325 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 12 

Score = 131 bits (144), Expect = 5e-27 
Identities = 90/102 (88%), Gaps = 0/102 (0%) 
Strand=Plus/Plus 

Query 1674 TCGCACTTCGAGTTCCCGGACTACTGCACGCCCGAGGTGAGCGAGATGATCTCGGGAGAC 1733 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Sbjct 258840 TCGCACTTCGAGTTCCCGGACTACTGCACCCCCGAGGTTACCGAGATGATCGCGGGGGAC 258899 

Query 1734 TGGCTCGAGTCCAGCATCTCCAACCTGGTTTTCACCTACTGA 1775 

I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 258900 TGGCGCCCGTCTAGCATCGCAGACCTGGTTTTCACCTACTGA 258941 



Query 


494 


Sbjct 


258081 


Query 


554 


Sbjct 


258140 


Query 


614 


Sbjct 


258200 


Query 


674 


Sbjct 


258260 


Query 


734 


Sbjct 


258320 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 

Strand=Plus /Minus 
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Query 1148 CGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 9967415 CGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGC 9967377 



Features in this part of subject sequence: 
similar to hCG2045825 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 
Strand=Plus /Minus 

Query 1148 CGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 9967421 CGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGC 9967383 



Features in this part of subject sequence: 
similar to hCG2045825 



Score =44.6 bits (48), Expect =0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 
Strand=Plus /Minus 

Query 1148 CGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 9967427 CGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGC 9967389 



Features in this part of subject sequence: 
similar to hCG2045825 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 
Strand=Plus /Minus 

Query 1148 CGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 9967433 CGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGC 9967395 



>ref I NT_011387 . 8 I Hs20_11544 ilS^ Homo sapiens chromosome 20 genomic contig, reference assembly 
Length=26259569 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 12 



Score = 291 bits (322), Expect = 3e-75 
Identities = 213/246 (86%), Gaps = 1/246 
Strand=Plus/Plus 



Query 


494 


Sbjct 


246656 


Query 


554 


Sbjct 


246715 


Query 


614 


Sbjct 


246775 


Query 


674 


Sbjct 


246835 


Query 


734 


Sbjct 


246895 



CCCGAGCTGGTGCAAGACCCCGAGTGGGCACATCAAGCGACCCATGAACGCCTTCATGGT 553 
I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I II I I I I I I I I I I I I I I I I 

CCCG-GCTGGTGCAAGACCCCGAGCGGCCACATCAAGAGGCCGATGAACGCATTCATGGT 2 46 714 

GTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCAGTCGCCCGACATGCACAACGCCGA 613 
I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GTGGTCGCAGCACGAACGGCGGAAGATCATGGACCAGTGGCCCGACATGCACAACGCCGA 2 4677 4 

GATCTCCAAGCGGCTGGGCAAACGCTGGAAGCTGCTCAAAGACAGCGACAAGATCCCTTT 6 73 
I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I III II I I I I I I I I II 

GATCTCCAAGCGCCTGGGCCGCCGCTGGCAGCTGCTGCAGGACTCGGAGAAGATCCCGTT 2 46 83 4 

CATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTA 733 
I I II I I I II II II II II II II I I I I I I I I I I I I I I II I I I I I I I I II II II I I 

CGTGCGGGAGGCGGAGCGGCTGCGGCTCAAGCACATGGCGGATTACCCGGACTACAAGTA 24689 4 



Features in this part of subject sequence: 
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SRY (sex determining region Y)-box 12 



Page 27 of 75 



Score = 131 bits (144), Expect = 5e-27 
Identities = 90/102 (88%), Gaps = 0/102 (0%) 
Strand=Plus/Plus 

Query 1674 TCGCACTTCGAGTTCCCGGACTACTGCACGCCCGAGGTGAGCGAGATGATCTCGGGAGAC 1733 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
Sbjct 247415 TCGCACTTCGAGTTCCCGGACTACTGCACCCCCGAGGTTACCGAGATGATCGCGGGGGAC 24747 4 

Query 1734 TGGCTCGAGTCCAGCATCTCCAACCTGGTTTTCACCTACTGA 1775 

I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 247475 TGGCGCCCGTCTAGCATCGCAGACCTGGTTTTCACCTACTGA 247516 



Features in this part of subject sequence: 
similar to hCG2045825 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 
Strand=Plus /Minus 

Query 1148 CGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 9955587 CGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGC 9955549 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 
Strand=Plus /Minus 

Query 1148 CGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 9955593 CGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGC 9955555 



Features in this part of subject sequence: 
similar to hCG2045825 



Score =44.6 bits (48), Expect =0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 
Strand=Plus /Minus 

Query 1148 CGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 9955599 CGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGC 9955561 



Features in this part of subject sequence: 
similar to hCG2045825 



Score =44.6 bits (48), Expect =0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 
Strand=Plus /Minus 

Query 1148 CGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 9955605 CGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGC 9955567 



>ref I NW_001838084 . 2 I Hsl3_WGA793_36 il^ Homo sapiens chromosome 13 genomic contig, alternate as 

(based on HuRef SCAF_1103279188152 ) 

Length=25468656 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 

SRY-box 21 



Score = 140 bits (154), Expect = le-29 
Identities = 164/219 (74%), Gaps = 2/219 (0%) 
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Strand=Plus/Plus 

CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 581 
III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CACGTCAAGCGGCCCATGAACGCCTTCATGGTGTGGTCGCGGGCTCAGCGGCGCAAGATG 16996259 

ATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGG 641 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
GCCCAGGAGAACCCCAAGATGCACAACTCGGAGATCAGCAAGCGCTTGGGCGCCGAGTGG 16996319 

AAGCTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCCT 700 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AAACTGCTCACAGAGTCGGAGAAGCGGCCGTTCA-TCGACGAGGCCAAGCGTCTACGCGC 16996378 

CAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 

II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
CATGCACATGAAGGAGCACCCCGACTACAAGTACCGGCC 16996417 



Query 


522 


Sbjct 


16996200 


Query 


582 


Sbjct 


16996260 


Query 


642 


Sbjct 


16996320 


Query 


701 


Sbjct 


16996379 



Features in this part of subject sequence: 
hypothetical protein 



Score =42.8 bits (46), Expect =2.2 
Identities = 28/31 (90%), Gaps = 0/31 (0%) 
Strand=Plus /Minus 

Query 2050 CTGTGATTGTTGTTATTGATGTTGTTGTTGA 2080 

I I I I I I I I I I I I I III I I I I I I I I I I I I 
Sbjct 1920740 CTGTGGTTGTTGTTGTTGTTGTTGTTGTTGA 1920710 



>ref I NT_009952 . 14 I Hsl3_10109 S Homo sapiens chromosome 13 genomic contig, reference assembly 
Length=25443670 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY-box 21 



Score = 140 bits (154), Expect = le-29 
Identities = 164/219 (74%), Gaps = 2/219 (0%) 
Strand=Plus /Minus 

CACATCAAGCGACCCATGTIACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 581 
III II II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CACGTCAAGCGGCCCATG7VACGCCTTCATGGTGTGGTCGCGGGCTCAGCGGCGCAAGATG 8 453902 

ATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGG 6 41 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
GCCCAGGAGAACCCCAAGATGCACAACTCGGAGATCAGCAAGCGCTTGGGCGCCGAGTGG 8 453 8 42 

AAGCTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCCT 70 0 
I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

AAACTGCTCACAGAGTCGGAGAAGCGGCCGTTCAT-CGACGAGGCCAAGCGTCTACGCGC 8 453 7 83 



Query 


522 


Sbjct 


8453961 


Query 


582 


Sbjct 


8453901 


Query 


642 


Sbjct 


8453841 


Query 


701 


Sbjct 


8453782 



CAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 

II I I I I I I I II I I I I I I I I M M M I II I I I I I 
CATGCACATGAAGGAGCACCCCGACTACAAGTACCGGCC 



739 

8453744 



Features in this part of subject sequence: 
insulin receptor substrate 2 
hypothetical protein 



Score =42.8 bits (46), Expect =2.2 
Identities = 28/31 (90%), Gaps = 0/31 (0%) 
Strand=Plus/Plus 

Query 2 050 CTGTGATTGTTGTTATTGATGTTGTTGTTGA 2080 

I I I I I I I I I I I I I III I I I I I I I I I I I I 
Sbjct 23527988 CTGTGGTTGTTGTTGTTGTTGTTGTTGTTGA 23528018 



>ref I NW_001838403 . 1 1 Hsl7_WGA1112_36 M Homo sapiens chromosome 17 genomic contig, alternate a 

(based on HuRef SCAF_1103279188371 ) 

Length=15048748 



)://blast.ncbi.nlm.nih.gov/Blast.( 
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Page 29 of 75 



Score = 129 bits (142), Expect = 2e-26 
Identities = 158/213 (74%), Gaps = 2/213 (0%) 
Strand=Plus /Minus 

AAGCGACCCATGAACGCCTTCATGGTGTGGT-CGCAGATCGAGCGGCGCAAGATCATGGA 5 86 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I III I I I I I I 

AAGCGGCCGATGAACGCGTTCATGGTGTGGAGCTCCGCTC-AGCGCCGCCAGATGGCGCA 7 078121 

GCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGCT 6 46 
I I I I III I I I I I I I I I I I I I II II II II I I I I I I I I I I I I I I I I I I I I 

GCAGAACCCCAAGATGCACAACTCCGAGATCTCCAAGCGCCTGGGCGCGCAGTGGAAGCT 7 0 78061 

GCTCAAAGACAGCGACAAGATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCA 7 06 
III III I I I I I I I I I I I I I I I I I II II I I I I I I I II 

GCTGGACGAGGACGAGAAGCGGCCCTTCGTGGAGGAGGCCAAGCGGCTCCGCGCCCGACA 7078 001 

CATGGCTGACTACCCCGACTACAAGTACCGGCC 739 
I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCTGCGCGACTACCCCGACTACAAGTACCGGCC 7077968 



Query 


528 


Sbjct 


7078179 


Query 


587 


Sbjct 


7078120 


Query 


647 


Sbjct 


7078060 


Query 


707 


Sbjct 


7078000 



>ref I NT_010718 . 15 I Hsl7_10875 Ifi Homo sapiens chromosome 17 genomic contig, reference assembly 
Length=21163833 

Features in this part of subject sequence: 
SRY-box 15 



Score = 129 bits (142), Expect = 2e-26 
Identities = 158/213 (74%), Gaps = 2/213 (0%) 
Strand=Plus /Minus 

AAGCGACCCATGAACGCCTTCATGGTGTGGT-CGCAGATCGAGCGGCGCAAGATCATGGA 586 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I III I I I I I I 
AAGCGGCCGATGAACGCGTTCATGGTGTGGAGCTCCGCTC-AGCGCCGCCAGATGGCGCA 7 090137 



GCTCAAAGACAGCGACAAGATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAGCA 
III III I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

GCTGGACGAGGACGAGAAGCGGCCCTTCGTGGAGGAGGCCAAGCGGCTCCGCGCCCGACA 



Query 


528 


Sbjct 


7090195 


Query 


587 


Sbjct 


7090136 


Query 


647 


Sbjct 


7090076 


Query 


707 


Sbjct 


7090016 



CATGGCTGACTACCCCGACTACAAGTACCGGCC 
I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCTGCGCGACTACCCCGACTACAAGTACCGGCC 



>ref I NW_001838454 . 2 I Hsl7_WGA1163_36 Homo sapiens chromosome 17 genomic contig, alternate a 

(based on HuRef SCAF_1103279188168 ) 

Length=12595466 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
transcription factor S0X9 



Score = 104 bits (114), Expect = 8e-19 
Identities = 96/122 (78%), Gaps = 0/122 (0%) 
Strand=Plus /Minus 

GCACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGAT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I III I 
GCACGTCAAGCGGCCCATGAACGCCTTCATGGTGTGGGCGCAGGCGGCGCGCAGGAAGCT 

CATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTG 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGCGGACCAGTACCCGCACTTGCACAACGCCGAGCTCAGCAAGACGCTGGGCAAGCTCTG 



GA 5396313 



Query 


521 


Sbjct 


5396434 


Query 


581 


Sbjct 


5396374 


Query 


641 


Sbjct 


5396314 
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Features in this part of subject sequence: 
transcription factor S0X9 



Score = 59.0 bits (64), Expect = 3e-05 
Identities = 70/95 (73%), Gaps = 0/95 (0%) 
Strand=Plus /Minus 

Query 645 CTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAG 70 4 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Sbjct 5395414 CTTCTGAACGAGAGCGAGAAGCGGCCCTTCGTGGAGGAGGCGGAGCGGCTGCGCGTGCAG 5395355 

Query 705 CACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 

I I I I I III I I II II I I I I I I I I I I III 
Sbjct 5395354 CACAAGAAGGACCACCCGGATTACAAGTACCAGCC 5395320 



Features in this part of subject sequence: 

regulator of G-protein signaling 9 isoform 2 
regulator of G-protein signaling 9 isoform 1 



Score =42.8 bits (46), Expect =2.2 
Identities = 28/31 (90%), Gaps = 0/31 (0%) 
Strand=Plus/Plus 

Query 2050 CTGTGATTGTTGTTATTGATGTTGTTGTTGA 2080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 12330673 CTGTTGTTGTTGTTATTGCTGTTGTTGTTGA 12330703 



Features flanking this part of subject sequence: 

13946 bp at 5' side: voltage-dependent calcium channel gamma-5 subunit isoform b 
59295 bp at 3' side: protein kinase C, alpha 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus /Minus 

Query 2051 TGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

III I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 10658531 TGTTATTGTTGTTGTTGCTGTTGTTGTTGTTG 10658500 



>ref |NW_001838884.2|Hs3_WGA277_36 1^ Homo sapiens chromosome 3 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188385) 

Length=64955803 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 

SRY-box 14 



Score = 104 bits (114), Expect = 8e-19 
Identities = 161/224 (71%), Gaps = 12/224 (5% 
Strand=Plus/Minus 



Query 


522 


Sbjct 


56570764 


Query 


582 


Sbjct 


56570704 


Query 


639 


Sbjct 


56570647 


Query 


696 


Sbjct 


56570590 



CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 581 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CACATCAAGCGGCCCATGAACGCCTTCATGGTATGGTCCCGGGGCCAGCGGCGCAAGATG 56570705 

ATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGG CAAACGC 63 8 

II I III I I I II I I I I I I Mill I I I I I I I I I I III 
GCCCAGGAAAACCCCAAGATGCACAACTCGGAGATCAGCAAACGCCTAGGTGCCGAA 56570648 

TGGAAGCTGC — TCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTG 695 
I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
TGGAAGCTTCTGTCCGAGGCA — GAGAAGCGGCCATACA-TCGATGAAGCCAAGCGGCTA 56570591 

CGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 
III I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
CGCGCCCAGCACATGAAGGAGCACCCTGACTACAAGTACCGACC 56570547 



Features in this part of subject sequence: 
sex-determining region Y-box 2 



)://blast.ncbi.nlm.nih.gov/Blast.( 
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Score = 91.5 bits (100), Expect = 5e-15 
Identities = 153/219 (69%), Gaps = 2/219 (0%) 
Strand=Plus /Minus 



Page 31 of 75 



Query 


526 


Sbjct 


12593937 


Query 


586 


Sbjct 


12593877 


Query 


646 


Sbjct 


12593817 


Query 


705 


Sbjct 


12593758 



TCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TCAAGCGGCCCATGAATGCCTTCATGGTGTGGTCCCGCGGGCAGCGGCGCAAGATGGCCC 

AGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AGGAGAACCCCAAGATGCACAACTCGGAGATCAGCAAGCGCCTGGGCGCCGAGTGGAAAC 

TGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCCTCAAG 
I I III I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

TTTTGTCGGAGACGGAGAAGCGGCCGTTCA-TCGACGAGGCTAAGCGGCTGCGAGCGCTG 

CACATGGCTGACTACCCCGACTACAAGTACCGGCCCAGG 7 43 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
CACATGAAGGAGCACCCGGATTATAAATACCGGCCCCGG 12593720 



585 

12593878 



Features flanking this part of subject sequence: 

368249 bp at 5' side: procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 isofor... 
1716439 bp at 3' side: hypothetical protein LOC205428 



Score =46.4 bits (50), Expect =0.18 
Identities = 31/35 (88%), Gaps = 0/35 (0%) 
Strand=Plus/Plus 

Query 2228 CATCACCACCTTGGTTTTGTTTTATTTTGCTTCTT 2262 

I I I I I III I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 48631837 CATCATCACTTTGGTTATGTTTTATTTTGGTTCTT 48631871 



Features flanking this part of subject sequence: 

555396 bp at 5' side: similar to peptidylprolyl isomerase A-like 
680809 bp at 3 ' side: butyrylcholinesterase precursor 



Score =42.8 bits (46), Expect =2.2 
Identities = 26/28 (92%), Gaps = 0/28 (0%) 
Strand=Plus/Plus 

Query 2055 ATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I III I I I I I I I I I I I I I I 
Sbjct 27830621 ATTGTTGTTGTTGTTGTTGTTGTTGATG 27830648 



Features flanking this part of subject sequence: 

8911 bp at 5' side: sterile alpha motif domain containing 7 
40774 bp at 3' side: leucine rich repeat containing 31 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 31/37 (83%), Gaps = 0/37 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCAAAAAAAA 2092 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 24430482 TTGTTGTTATTGTTGTTGTTGTTGTTCTGAAAAATAA 24430518 



Features flanking this part of subject sequence: 
360070 bp at 5' side: similar to hCG2021878 
1305561 bp at 3' side: hypothetical protein LOC131149 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 33/40 (82%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 180 TTCTCTCTTTACCCACCTCCGCCCCTGCGAGGAGTTGAGG 219 

I I I I I I I I I I I I I III III I I I I III I I I I I I I 
Sbjct 31502884 TTCTCTCTTTACACCACTCAGCCGCTGCCAGGGGTTGAGG 31502845 



>ref I NT_010641 . 15 I Hsl7_10798 Homo sapiens chromosome 17 genomic contig, reference assembly 
Length=11472733 

Sort alignments for this subject seq 
E value Score Percent identity 
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Query start position Subject star 

Features in this part of subject sequence: 
transcription factor S0X9 

Score = 104 bits (114), Expect = 8e-19 
Identities = 96/122 (78%), Gaps = 0/122 (0%) 
Strand=Plus/Plus 

GCACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGAT 580 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I III I 
GCACGTCAAGCGGCCCATGAACGCCTTCATGGTGTGGGCGCAGGCGGCGCGCAGGAAGCT 4044172 

CATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTG 640 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGCGGACCAGTACCCGCACTTGCACAACGCCGAGCTCAGCAAGACGCTGGGCAAGCTCTG 4044232 

GA 642 
I I 

GA 4044234 



Query 


521 


Sbjct 


4044113 


Query 


581 


Sbjct 


4044173 


Query 


641 


Sbjct 


4044233 



Features in this part of subject sequence: 
transcription factor S0X9 



Score =59.0 bits (64), Expect = 3e-05 
Identities = 70/95 (73%), Gaps = 0/95 (0%) 
Strand=Plus/Plus 

Query 645 CTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGAGAGGCGGAGCGGCTGCGCCTCAAG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Sbjct 4045133 CTTCTGAACGAGAGCGAGAAGCGGCCCTTCGTGGAGGAGGCGGAGCGGCTGCGCGTGCAG 

Query 705 CACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 

I I I I I III I I I I II I I I I I I I I I I III 
Sbjct 4045193 CACAAGAAGGACCACCCGGATTACAAGTACCAGCC 4045227 



>ref I NT_005612 . 15 I Hs3_5769 S Homo sapiens chromosome 3 genomic contig, reference assembly 
Length=100530253 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY-box 14 



Score = 104 bits (114), Expect = 8e-19 
Identities = 161/224 (71%), Gaps = 12/224 (5%) 

Strand=Plus/Plus 



Query 


522 


CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 


581 


Sbjct 


43978791 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 
CACATCAAGCGGCCCATGAACGCCTTCATGGTATGGTCCCGGGGCCAGCGGCGCAAGATG 


43978850 


Query 


582 


ATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGG CAAACGC 


638 


Sbjct 


43978851 


III 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II II II III 

GCCCAGGAAAACCCCAAGATGCACAACTCGGAGATCAGCAAACGCCTAGGTGCCGAA 


43978907 


Query 


639 


TGGAAGCTGC — TCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTG 


695 


Sbjct 


43978908 


1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 11 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TGGAAGCTTCTGTCCGAGGCA — GAGAAGCGGCCATACA-TCGATGAAGCCAAGCGGCTA 


43978964 


Query 


696 


CGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 
III 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 II 
CGCGCCCAGCACATGAAGGAGCACCCTGACTACAAGTACCGACC 43979008 




Sbjct 


43978965 





Features in this part of subject sequence: 
sex-determining region Y-box 2 



Score = 91.5 bits (100), Expect = 5e-15 
Identities = 153/219 (69%), Gaps = 2/219 (0%) 
Strand=Plus/Plus 

Query 526 TCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGG 585 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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TCAAGCGGCCCATGAATGCCTTCATGGTGTGGTCCCGCGGGCAGCGGCGCAAGATGGCCC 87925479 

AGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAGC 645 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

AGGAGAACCCCAAGATGCACAACTCGGAGATCAGCAAGCGCCTGGGCGCCGAGTGGAAAC 87925539 

TGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCCTCAAG 704 
I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TTTTGTCGGAGACGGAGAAGCGGCCGTTCA-TCGACGAGGCTAAGCGGCTGCGAGCGCTG 87925598 

CACATGGCTGACTACCCCGACTACAAGTACCGGCCCAGG 743 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
CACATGAAGGAGCACCCGGATTATAAATACCGGCCCCGG 87925637 



Sbjct 


87925420 


Query 


586 


Sbjct 


87925480 


Query 


646 


Sbjct 


87925540 


Query 


705 


Sbjct 


87925599 



Features flanking this part of subject sequence: 
1479986 bp at 5' side: similar to PRO0419 

371407 bp at 3' side: procollagen-lysine, 2-oxoglutarate 5-dioxygenase 2 isofor... 



Score = 46.4 bits (50), Expect = 0.18 
Identities = 31/35 (88%), Gaps = 0/35 (0%) 
Strand=Plus /Minus 

Query 2228 CATCACCACCTTGGTTTTGTTTTATTTTGCTTCTT 2262 

I I I I I III I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 51912243 CATCATCACTTTGGTTATGTTTTATTTTGTTTCTT 51912209 



Features flanking this part of subject sequence: 

680379 bp at 5' side: butyrylcholinesterase precursor 
554392 bp at 3' side: hypothetical protein 



Score =42.8 bits (46), Expect =2.2 
Identities = 26/28 (92%), Gaps = 0/28 (0%) 
Strand=Plus /Minus 

Query 2055 ATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I III I I I I I I I I I I I I I I 
Sbjct 72724377 ATTGTTGTTGTTGTTGTTGTTGTTGATG 72724350 



Features flanking this part of subject sequence: 

51277 bp at 5' side: nuclear factor of kappa light polypeptide gene enhancer i... 
83898 bp at 3' side: hypothetical protein 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I M M III I I I I I I I I I I I I I I 
Sbjct 8124638 TTGTTGTTTTTGTTGTTGTTGTTGATG 8124664 



Features in this part of subject sequence: 
hypothetical protein 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
St rand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 20900949 TTGTTGTTATTGCTGTTGTTGTTGTTG 20900923 



Features flanking this part of subject sequence: 

1323998 bp at 5' side: hypothetical protein LOC131149 
371774 bp at 3' side: similar to hCG2021878 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 33/40 (82%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 180 TTCTCTCTTTACCCACCTCCGCCCCTGCGAGGAGTTGAGG 219 

I I I I I I I I I I I I I III III I I I I III I I I I I I I 
Sbjct 69040917 TTCTCTCTTTACACCACTCAGCCGCTGCCAGGGGTTGAGG 69040878 
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Features flanking this part of subject sequence: 

40760 bp at 5' side: leucine rich repeat containing 31 
8896 bp at 3' side: sterile alpha motif domain containing 7 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 31/37 (83%), Gaps = 0/37 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCAAAAAAAA 2092 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 76123541 TTGTTGTTATTGTTGTTGTTGTTGTTCTGAAAAATAA 76123505 



>ref I NW_001842405 . 1 I HsX_WGA1366_36 S Homo sapiens chromosome X genomic contig, alternate ass 

(based on HuRef SCAF_1103279188404) 

Length=3857110 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 3 



Score = 100 bits (110), Expect = 9e-18 
Identities = 154/216 (71%), Gaps = 8/216 (3%) 

Strand=Plus /Minus 

AAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGAT CATG 

II II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II 

AAACGGCCCATGAACGCCTTCATGGTATGGTCCCGCGGGCAGCGGCGCAAAATGGCCCTG 

GAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

GAGAAC CCCAAGATGCACAATTCTGAGATCAGCAAGCGCTTGGGCGCCGACTGGAAA 



Query 


528 




Sbjct 


3649310 


Query 


585 




Sbjct 


364? 


5250 


Query 


645 




Sbjct 


364? 


5193 


Query 


704 




Sbjct 


364? 


5134 



584 

3649251 
644 

3649194 

CTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCCTCAA 7 03 
I I I I I I III I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
CTGCTGACCGACGCCGAGAAGCGACCATTCAT-CGACGAGGCCAAGCGACTTCGCGCCGT 3649135 

GCACATGGCTGACTACCCCGACTAC7VAGTACCGGCC 73 9 
I I I I I I I II II II I I I I I I I M M II I II 
GCACATGAAGGAGTATCCGGACTACAAGTACCGACC 3649099 



Features flanking this part of subject sequence: 

166860 bp at 5' side: zinc finger protein of the cerebellum 3 
893562 bp at 3' side: fibroblast growth factor 13 isoform lA 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus/Plus 

Query 2 0 48 ACCTGTGATTGTTGTTATTGATGTTGTTGTTG 2079 

I I I I I I I I I III II III I I I I I I I I I I I 
Sbjct 882950 ACCTGTGATAGTTTTTGTTGTTGTTGTTGTTG 882981 



>ref I NT_011786 . 15 I HsX_11943 S Homo sapiens chromosome X genomic contig, reference assembly 
Length=27718692 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 3 



Score = 100 bits (110), Expect = 9e-18 
Identities = 154/216 (71%), Gaps = 8/216 (3%) 
Strand=Plus /Minus 

Query 528 AAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGAT CATG 584 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II 

Sbjct 23798156 AAACGGCCCATGAACGCCTTCATGGTATGGTCCCGCGGGCAGCGGCGCAAAATGGCCCTG 23798097 

Query 585 GAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAAG 644 
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23798096 




645 


sbjit 


23798039 






Sbjct 


23797980 



CTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCCTCAA 
I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTGCTGACCGACGCCGAGAAGCGACCATTCAT-CGACGAGGCCAAGCGACTTCGCGCCGT 

GCACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 
I I I I I I I II II II I I I I I I I I I I I I I I II 
GC AC ATGAAGGAGTATCCGGACTACAAGTACCGACC 23797945 



Features in this part of subject sequence: 
dedicator of cytokinesis 11 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 1149 GCCTCGGCCTCCGCCTCCTCGGCAGCC 1175 

I I I I I I I I I I I I I I I I I I I II I I I I 
Sbjct 1898025 GCCTCCGCCTCCGCCTCCTCCGCAGCC 1897999 



Features flanking this part of subject sequence: 
1062887 bp at 5' side: glutamate dehydrogenase 2 

1072247 bp at 3' side: glutamate receptor 3 isoform flop precursor 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 33/40 (82%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCaaaaasi-iiaaaa 2095 

I I I I I I I I III I I I I I I I I I I I I II I I I I I III 
Sbjct 5457504 TTGTTGTTGTTGTTGTTGTTGTTGTTTTCAGAAAAACAAA 5457465 



Features flanking this part of subject sequence: 

168026 bp at 5 ' side: zinc finger protein of the cerebellum 3 
894725 bp at 3' side: fibroblast growth factor 13 isoform IB 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus/Plus 

Query 2048 ACCTGTGATTGTTGTTATTGATGTTGTTGTTG 2079 

I II II II II III M III I I I I I I I I I I I 
Sbjct 21031603 ACCTGTGATAGTTTTTGTTGTTGTTGTTGTTG 21031634 



>ref I NT_027140 . 6 I Hsl3_27300 il^ Homo sapiens chromosome 13 genomic contig, reference assembly 
Length=1821999 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 1 



Score = 96.9 bits (106), Expect = le-16 
Identities = 152/215 (70%), Gaps = 2/215 (0%) 
Strand=Plus/Plus 

TCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGG 
I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 

TCAAACGGCCCATGAACGCCTTCATGGTGTGGTCCCGCGGGCAGCGGCGCAAGATGGCCC 



TGCTCAAAGACAGCGACAAGATCCCTTTCATTCGA-GAGGCGGAGCGGCTGCGCCTCAAG 
I I II I I I I I I M I I I I I I I I II I I I I I I I I I I I I I I I 

TCATGTCCGAGGCCGAGAAGCGGCCGTTCA-TCGACGAGGCCAAGCGGCTGCGCGCGCTG 



Query 


526 


Sbjct 


218130 


Query 


586 


Sbjct 


218190 


Query 


646 


Sbjct 


218250 


Query 


705 
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Sbjct 218309 CACATGAAGGAGCACCCGGATTACAAGTACCGGCC 218343 
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Features flanking this part of subject sequence: 

269051 bp at 5' side: SRY (sex determining region Y)-box 1 
38475 bp at 3 ' side: hypothetical protein LOC122258 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 488205 TTGTTGTTGTTGATGTTGTTGTTGTTG 488231 



>ref I NW_001838745 . 1 1 Hs22_WGA1304_36 fe^ Homo sapiens chromosome 22 genomic contig, alternate a 

(based on HuRef SCAF_1103279188372 ) 

Length=21026802 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 10 



Score = 95.1 bits (104), Expect = 4e-16 
Identities = 94/122 (77%), Gaps = 0/122 (0%) 
Strand=Plus /Minus 



Query 


521 


Sbjct 


16472614 


Query 


581 


Sbjct 


16472554 


Query 


641 


Sbjct 


16472494 



GCACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGAT 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I III I III I 
GCACGTCAAGCGGCCCATGAACGCCTTCATGGTGTGGGCTCAGGCAGCGCGCAGGAAGCT 

CATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTG 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGCGGACCAGTACCCGCACCTGCACAACGCTGAGCTCAGCAAGACGCTGGGCAAGCTCTG 



4 GA 16472493 



Features in this part of subject sequence: 
SRY (sex determining region Y)-box 10 



Score = 77.0 bits (84), Expect = le-10 
Identities = 79/101 (78%), Gaps = 2/101 (1%) 
St rand=Plus /Minus 

Query 644 GCTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGAG-AGGCGGAGCGGCTGCGCCTCA 702 

I I I I I I II II II I I I I I I III I I I I I I I I I I I I I I I I I I I I I II I 
Sbjct 16467270 GCTGCTGAACGAAAGTGACAAGCGCCCCTTCAT-CGAGGAGGCTGAGCGGCTCCGTATGC 16467212 

Query 703 AGCACATGGCTGACTACCCCGACTACAAGTACCGGCCCAGG 743 

I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 16467211 AGCACAAGAAAGACCACCCGGACTACAAGTACCAGCCCAGG 16467171 



Features flanking this part of subject sequence: 
22174 bp at 5' side: like-glycosyltransf erase 
1289151 bp at 3' side: intestine-specific homeobox 



Score =42.8 bits (46), Expect =2.2 
Identities = 25/26 (96%), Gaps = 0/26 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGAT 2081 

II II II II II II I I I I I I I I I I I I I 
Sbjct 12263566 TTGTTGTTATTGTTGTTGTTGTTGAT 12263591 



Features in this part of subject sequence: 
calcineurin binding protein 1 



Score = 41.0 bits (44), Expect = 7.5 
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Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCAAA 2087 

I I I I I I I I III I I I I I I I I I I I III III 
Sbjct 2585440 TTGTTGTTGTTGTTGTTGTTGTTGTTGGTAAA 2585471 



>ref I NT_011520 . 11 I Hs22_11677 IS Homo sapiens chromosome 22 genomic contig, reference assembly 
Length=23276302 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 10 



Score = 95.1 bits (104), Expect = 4e-16 
Identities = 94/122 (77%), Gaps = 0/122 (0%) 

Strand=Plus /Minus 



Query 


521 


Sbjct 


17770001 


Query 


581 


Sbjct 


17769941 


Query 


641 


Sbjct 


17769881 



GCACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGAT 5 8 0 
II II I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I III I III I III I 

GCACGTCAAGCGGCCCATGAACGCCTTCATGGTGTGGGCTCAGGCAGCGCGCAGGAAGCT 17769942 

CATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTG 6 4 0 
I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CGCGGACCAGTACCCGCACCTGCACAACGCTGAGCTCAGCAAGACGCTGGGCAAGCTCTG 17769882 



Features in this part of subject sequence: 
SRY (sex determining region Y)-box 10 



Score = 77.0 bits (84), Expect = le-10 
Identities = 79/101 (78%), Gaps = 2/101 (1%) 
Strand=Plus /Minus 

Query 644 GCTGCTCAAAGACAGCGACAAGATCCCTTTCATTCGAG-AGGCGGAGCGGCTGCGCCTCA 702 

I I I I I I II II II I I I I I I III I I I I I I I I I I I I I I I I I I I I I II I 
Sbjct 17764657 GCTGCTGAACGAAAGTGACAAGCGCCCCTTCAT-CGAGGAGGCTGAGCGGCTCCGTATGC 17764599 

Query 703 AGCACATGGCTGACTACCCCGACTACAAGTACCGGCCCAGG 743 

I I I I I I I III MM I I I I I I I I I I I I I I I I I I I I 
Sbjct 17764598 AGCACAAGAAAGACCACCCGGACTACAAGTACCAGCCCAGG 17764558 



Features flanking this part of subject sequence: 
22175 bp at 5' side: like-glycosyltransf erase 
1283418 bp at 3 ' side: intestine-specific homeobox 



Score =42.8 bits (46), Expect =2.2 
Identities = 25/26 (96%), Gaps = 0/26 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGAT 2081 

I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 13570207 TTGTTGTTATTGTTGTTGTTGTTGAT 13570232 



Features in this part of subject sequence: 
calcineurin binding protein 1 



Score = 41.0 bits (44), Expect = 7.5 

Identities = 28/32 (87%), Gaps = 0/32 (0%) 

Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCAAA 2087 

II II II II III II II II II I M III III 
Sbjct 3900601 TTGTTGTTGTTGTTGTTGTTGTTGTTGGTAAA 3900632 



>ref I NW_001839122 . 2 I Hs8_WGA515_36 kt« Homo sapiens chromosome 8 genomic contig, alternate asse 
(based on HuRef SCAF 1103279188150) 
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Length=4000776 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

is part of subject sequence: 

Score = 87.8 bits (96), Expect = 6e-14 
Identities = 63/72 (87%), Gaps = 3/72 (4%) 
Strand=Plus/Plus 

Query 681 GAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC- 739 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 1473842 GAGGCGGAGCGGCTGCGCCTGCAGCACATGCAGGACTACCCCAACTACAAGTACCGGCCG 1473901 

Query 740 — CAGGAAGAAG 749 

I I I I I I I I I I 

Sbjct 1473902 CGCAGGAAGAAG 1473913 



Score =53.6 bits (58), Expect = 0.001 
Identities = 73/102 (71%), Gaps = 0/102 (0%) 
Strand=Plus/Plus 

Query 530 GCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCA 589 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I 

Sbjct 1470162 GCGGCCCATGAACGCCTTCATGGTTTGGGCCAAGGACGAGAGGAAACGGCTGGCAGTGCA 1470221 

Query 590 GTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGG 631 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 1470222 GAACCCGGACCTGCACAACGCCGAGCTCAGCAAGATGCTGGG 1470263 

>ref I NT_077531 . 3 I Hs8_77580 S Homo sapiens chromosome 8 genomic contig, reference assembly 
Length=4537293 



Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 



Score = 87.8 bits (96), Expect = 6e-14 
Identities = 63/72 (87%), Gaps = 3/72 (4%) 
Strand=Plus /Minus 

Query 681 GAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC- 739 

I I I I I I I I I I I I I I i I I I I i I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 3059477 GAGGCGGAGCGGCTGCGCCTGCAGCACATGCAGGACTACCCCAACTACAAGTACCGGCCG 3059418 



Query 740 — CAGGAAGAAG 7 49 

I I I I I I I I I I 

Sbjct 3059417 CGCAGGAAGAAG 3059406 



Score =53.6 bits (58), Expect = 0.001 
Identities = 73/102 (71%), Gaps = 0/102 (0%) 
Strand=Plus /Minus 



Query 530 GCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAGCA 5 89 

I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I II I I I I 

Sbjct 3063157 GCGGCCCATGAACGCCTTCATGGTTTGGGCCAAGGACGAGAGGAAACGGCTGGCAGTGCA 3063098 

Query 590 GTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGG 631 

I II III I I I I I I I I I I I I I I II I I I I I I I I I I 
Sbjct 3063097 GAACCCGGACCTGCACAACGCCGAGCTCAGCAAGATGCTGGG 3063056 

>ref I NW_001838339 . 2 I Hsl6_WGA937_36 l3 Homo sapiens chromosome 16 genomic contig, alternate as 
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(based on HuRef SCAF_1103279188181 ) 
Length=2624016 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 8 



Score =86.0 bits (94), Expect = 2e-13 
Identities = 83/107 (77%), Gaps = 0/107 (0%) 
Strand=Plus /Minus 

Query 528 AAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAG 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I II 
Sbjct 1665887 AAGCGGCCCATGAACGCATTCATGGTGTGGGCGCAGGCGGCGCGCCGCAAGCTGGCCGAC 

Query 588 CAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAA 634 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 1665827 CAGTACCCGCACCTGCACAACGCCGAGCTCAGCAAGACGCTGGGCAA 1665781 



Features in this part of subject sequence: 
SRY (sex determining region Y)-box 8 



Score = 53.6 bits (58), Expect = 0.001 
Identities = 71/99 (71%), Gaps = 6/99 (6%) 
Strand=Plus /Minus 

Query 681 GAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTAC 734 

I I I I I I I I I I II III I I I I I I I I III I I I I I I I I I II I I I I I I 
Sbjct 1664351 GAGGCAGAGCGCCTTCGCGTGCAGCACAAGAAGGACCACCCCGACTACAAGTACCAGCCA 1664292 

Query 735 CGGCCCAGGAAGAAGGTGAAGTCCGGCAACGCCAACTCC 773 

I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Sbjct 1664291 CGGCGCAGGAAGAGCGCCAAAGCCGGCCACAGCGACTCC 1664253 



>ref I NT_037887 . 4 I Hsl6_37891 Homo sapiens chromosome 16 genomic contig, reference assembly 
Length=8576922 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY (sex determining region Y)-box 8 



Score = 86.0 bits (94), Expect = 2e-13 
Identities = 83/107 (77%), Gaps = 0/107 (0%) 
Strand=Plus/Plus 

Query 528 AAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGGAG 58 7 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I II 
Sbjct 972230 AAGCGGCCCATGAACGCATTCATGGTGTGGGCGCAGGCGGCGCGCCGCAAGCTGGCCGAC 97228 

Query 588 CAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAA 634 

I I I I II II I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Sbjct 972290 CAGTACCCGCACCTGCACAACGCCGAGCTCAGCAAGACGCTGGGCAA 972336 



Features in this part of subject sequence: 
SRY (sex determining region Y)-box 8 



Score = 53.6 bits (58), Expect = 0.001 
Identities = 71/99 (71%), Gaps = 6/99 (6%) 
Strand=Plus/Plus 

Query 6 81 GAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTAC 

I I I I I I I I I I II III I I I I I II I Ml I I I I I I I I I I I I I I I I I 
Sbjct 973766 GAGGCAGAGCGCCTTCGCGTGCAGCACAAGAAGGACCACCCCGACTACAAGTACCAGCCA 

Query 735 CGGCCCAGGAAGAAGGTGAAGTCCGGCAACGCCAACTCC 773 

I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Sbjct 973826 CGGCGCAGGAAGAGCGCCAAAGCCGGCCACAGCGACTCC 973864 
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>ref I NW_001838533 . 2 I Hsl_WGA129_36 ifi Homo sapiens chromosome 1 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188157) 

Length=37103761 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 



Score = 77.0 bits (84), Expect = le-10 
Identities = 73/91 (80%), Gaps = 2/91 (2%) 
Strand=Plus /Minus 

Query 522 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 581 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II II 
Sbjct 1847108 CACATCAAGAGGCCCATGAACGCCTTCATGGTGTGGGCCAAGGATGAGCGGAGGAAGATC 1847049 

Query 582 ATGGA-GCAGTCGCCCGACATGCACAACGCC 611 

II I II II II I I II II II II II II 
Sbjct 1847048 CTGCAAGCCTTC-CCAGACATGCACAACTCC 1847019 

Features in this part of subject sequence: 

protein phosphatase 1, regulatory (inhibitor) subunit 12B. . . 
protein phosphatase 1, regulatory (inhibitor) subunit 12B. . . 

Score =46.4 bits (50), Expect =0.18 
Identities = 34/40 (85%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCaaaaaa>::aaaa 2095 

I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 3618816 TTGTTGTTGTTGTTGTTGTTGTTGTTTAAAAAAAAAAAAA 3618777 

Features in this part of subject sequence: 

protein phosphatase 1, regulatory (inhibitor) subunit 12B. . . 
protein phosphatase 1, regulatory (inhibitor) subunit 12B. . . 

Score = 46.4 bits (50), Expect = 0.18 
Identities = 34/40 (85%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCaaa-u-::.-:i05aaaa 2095 

I I I I I I I I III I I I I I I I I I I I II I I I I I I I I I I 
Sbjct 3618819 TTGTTGTTGTTGTTGTTGTTGTTGTTGTTTAAAAAAAAAA 3618780 

Features flanking this part of subject sequence: 

76699 bp at 5' side: chromosome 1 open reading frame 21 
327829 bp at 3' side: hypothetical protein LOC116461 



Score =42.8 bits (46), Expect =2.2 
Identities = 26/28 (92%), Gaps = 0/28 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGG 2083 

I I I I I I I I III I I I I I I I I I I I I I I I 
Sbjct 21499677 TTGTTGTTGTTGTTGTTGTTGTTGATGG 21499650 

Features flanking this part of subject sequence: 

1296333 bp at 5' side: regulator of G-protein signalling 18 
409014 bp at 3' side: family with sequence similarity 5, member C 

Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I II I I I I I I I I I II 
Sbjct 15037624 TTGTTGTTATTGTTGTTGTTGTTGTTG 15037598 

Features in this part of subject sequence: 
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hypothetical protein 
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Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2423 CCt.ttttttCCTCCCTCTTTTCCCCTT 2449 

II I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 28353580 CCATTTTTTCCTGCCTCTTTTCCCCTT 28353606 



Features in this part of subject sequence: 

astrotactin isoform 1 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 22/22 (100%), Gaps = 0/22 (0%) 
Strand=Plus/Plus 

Query 2290 GCACCCCTccccccc;: v.ttt 2311 

I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 28766569 GCACCCCTCCCCCCCTTTTTTT 28766590 



>ref |NW_001838085.1|Hsl3_WGA794_36 Homo sapiens chromosome 13 genomic contig, alternate as 

(based on HuRef SCAF_1103279188104) 

Length=1807819 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 

287495 bp at 3 ' side: hypothetical protein LOC122258 



Score = 77.0 bits (84), Expect = le-10 
Identities = 80/105 (76%), Gaps = 0/105 (0%) 
Strand=Plus/Plus 

Query 526 TCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGG 585 

I I I I II I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Sbjct 226768 TCAAACGGCCCATGAACGCCTTCATGGTGTGGTCCCGCGGGCAGCGGCGCAAGATGGCCC 226827 

Query 586 AGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGG 630 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 226828 AGGAGAACCCCAAGATGCACAACTCGGAGATCAGCAAGCGCCTGG 226872 



Features flanking this part of subject sequence: 

39257 bp at 3' side: hypothetical protein LOC122258 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 475084 TTGTTGTTGTTGATGTTGTTGTTGTTG 475110 



>ref I NT_004487 . 18 I Hsl_4644 S Homo sapiens chromosome 1 genomic contig, reference assembly 
Length=56413061 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY-box 13 



Score = 77.0 bits (84), Expect = le-10 
Identities = 73/91 (80%), Gaps = 2/91 (2%) 

Strand=Plus/Plus 

Query 52 2 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 581 

I I I I I I I I I I I M M II II II I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Sbjct 54583178 CACATCAAGAGGCCCATGAACGCCTTCATGGTGTGGGCCAAGGATGAGCGGAGGAAGATC 54583237 
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Query 582 ATG-GAGCAGTCGCCCGACATGCACAACGCC 611 

II III I I I I I I I I I I I I I I I I I I 
Sbjct 54583238 CTGCAAGCCTTC-CCAGACATGCACAACTCC 54583267 

Features in this part of subject sequence: 

protein phosphatase 1, regulatory (inhibitor) subunit 12B. . . 
protein phosphatase 1, regulatory (inhibitor) subunit 12B. . . 

Score =46.4 bits (50), Expect =0.18 
Identities = 34/40 (85%), Gaps = 0/40 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCaaaaaaa-saaa 2095 

I I I I I I I I III I I I I I I I I I I I II I I I I I I I I I I 
Sbjct 52810695 TTGTTGTTGTTGTTGTTGTTGTTGTTGTTTAAAAAAAAAA 52810734 

Features in this part of subject sequence: 

protein phosphatase 1, regulatory (inhibitor) subunit 12B. . . 
protein phosphatase 1, regulatory (inhibitor) subunit 12B. . . 

Score = 46.4 bits (50), Expect = 0.18 
Identities = 34/40 (85%), Gaps = 0/40 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCaa:i-;=.:;aoaaaA 2095 

I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 52810698 TTGTTGTTGTTGTTGTTGTTGTTGTTTAAAAAAAAAAAAA 52810737 

Features flanking this part of subject sequence: 

327837 bp at 5' side: hypothetical protein LOC116461 
76799 bp at 3' side: chromosome 1 open reading frame 21 

Score =42.8 bits (46), Expect =2.2 
Identities = 26/28 (92%), Gaps = 0/28 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGG 2083 

I I I I I I I I III I I I I I I I I I I I I I I I 
Sbjct 34860172 TTGTTGTTGTTGTTGTTGTTGTTGATGG 34860199 

Features in this part of subject sequence: 
astrotactin isoform 1 
astrotactin isoform 2 

Score = 41.0 bits (44), Expect = 7.5 
Identities = 22/22 (100%), Gaps = 0/22 (0%) 
Strand=Plus /Minus 

Query 2290 GCACCCCTccccccc vr.tttt ■: 2311 

I I I I I I I I I I I I I I I I I I I I I I 

Sbjct 27602408 GCACCCCTCCCCCCCTTTTTTT 27602387 

Features in this part of subject sequence: 
hypothetical protein 

Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
St rand=Plus /Minus 

Query 2423 CCtttt tttCCTCCCTCTTTTCCCCTT 2449 

II I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct 28014783 CCATTTTTTCCTGCCTCTTTTCCCCTT 28014757 

Features flanking this part of subject sequence: 

409150 bp at 5' side: family with sequence similarity 5, member C 
1294572 bp at 3' side: regulator of G-protein signalling 18 

Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 

Strand=Plus/Plus 
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Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 41323524 TTGTTGTTATTGTTGTTGTTGTTGTTG 41323550 



Features flanking this part of subject sequence: 

409171 bp at 5' side: family with sequence similarity 5, member C 
1294551 bp at 3' side: regulator of G-protein signalling 18 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 41323545 TTGTTGTTATTGTTGTTGTTGTTGTTG 41323571 



>ref I NW_001839132 . 1 I Hs8_WGA525_36 S Homo sapiens chromosome 8 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188282 ) 

Length=30141836 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY-box 17 



Score =66.2 bits (72), Expect = 2e-07 
Identities = 50/59 (84%), Gaps = 0/59 (0%) 
Strand=Plus/Plus 

Query 681 GAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 

II I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I I I I 
Sbjct 7238558 GAGGCAGAGCGGCTGCGCGTGCAGCACATGCAGGACCACCCCAACTACAAGTACCGGCC 7238616 



Features in this part of subject sequence: 
SRY-box 17 



Score = 55.4 bits (60), Expect = 3e-04 
Identities = 76/105 (72%), Gaps = 6/105 (5%) 
Strand=Plus/Plus 

Query 530 GCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGG A 

I I I I I I I I I I I I I I I I I I I I I I I I I I II III I I I I I I I III I 

Sbjct 7237795 GCGGCCGATGAACGCTTTCATGGTGTGGGCTAAGGACGA GCGCAAGCGGCTGGCGCA 

Query 587 GCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGG 631 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 7237852 GCAGAATCCAGACCTGCACAACGCCGAGTTGAGCAAGATGCTGGG 7237896 



Features in this part of subject sequence: 
hypothetical protein LOC23514 



Score =41.0 bits (44), Expect =7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2 056 TTGTTGTTATTGATGTTGTTGTTGATG 2 082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 497262 TTGTTGTTATTGTTGTTGTTGTTGTTG 497288 



Features in this part of subject sequence: 
hypothetical protein LOC23514 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 30/35 (85%), Gaps = 0/35 (0%) 
Strand=Plus/Plus 

Query 2051 TGTGATTGTTGTTATTGATGTTGTTGTTGATGGCA 2085 

III I I I I I I I I I III I I I I I I I I I I I II II 
Sbjct 497266 TGTTATTGTTGTTGTTGTTGTTGTTGTTGTTGACA 497300 
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Features flanking this part of subject sequence: 

63747 bp at 5' side: ganglioside-induced differentiation-associated protein 1 ... 
181279 bp at 3' side: similar to hCG1799828 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 27230124 TTGTTGTTATTGTTGTTGTTGTTGTTG 27230098 



Features flanking this part of subject sequence: 

473228 bp at 5' side: cysteine-r ich secretory protein LCCL domain containing 1 
33133 bp at 3' side: hepatocyte nuclear factor 4, gamma 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I III I I I I I I I I I I I I I I 
Sbjct 28305493 TTGTTGTTGTTGTTGTTGTTGTTGATG 28305467 



>ref I NT_008183 . 18 I Hs8_8340 Homo sapiens chromosome 8 genomic contig, reference assembly 
Length=38454502 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
SRY-box 17 



Score =66.2 bits (72), Expect = 2e-07 
Identities = 50/59 (84%), Gaps = 0/59 (0%) 
Strand=Plus/Plus 

Query 681 GAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 

I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I II II 
Sbjct 7225020 GAGGCAGAGCGGCTGCGCGTGCAGCACATGCAGGACCACCCCAACTACAAGTACCGGCC 7225078 



Features in this part of subject sequence: 
SRY-box 17 



Score = 55.4 bits (60), Expect = 3e-04 
Identities = 76/105 (72%), Gaps = 6/105 (5%) 
Strand=Plus/Plus 

Query 53 0 GCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATCATGG A 586 

I II II II II II II I I I I I I I I I I I I I II III M I I I I I III I 
Sbjct 7224257 GCGGCCGATGAACGCTTTCATGGTGTGGGCTAAGGACGA GCGCAAGCGGCTGGCGCA 7224313 

Query 58 7 GCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGG 631 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 7224314 GCAGAATCCAGACCTGCACAACGCCGAGTTGAGCAAGATGCTGGG 7224358 



Features flanking this part of subject sequence: 

63856 bp at 5' side: ganglioside-induced differentiation-associated protein 1 ... 
184610 bp at 3' side: similar to hCG1799828 isoform 1 



Score =42.8 bits (46), Expect =2.2 

Identities = 31/36 (86%), Gaps = 0/36 (0%) 

Strand=Plus /Minus 

Query 2 051 TGTGATTGTTGTTATTGATGTTGTTGTTGATGGCAA 2 0 86 

III I I I I I I I I I III I I I I I I I I I I I I I I I I 
Sbjct 27193847 TGTTATTGTTGTTGTTGTTGTTGTTGTTGTTAGCAA 27193812 



Features in this part of subject sequence: 
hypothetical protein LOC23514 
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Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 488931 TTGTTGTTATTGTTGTTGTTGTTGTTG 488957 



Features in this part of subject sequence: 
hypothetical protein LOC23514 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 30/35 (85%), Gaps = 0/35 (0%) 
Strand=Plus/Plus 

Query 2051 TGTGATTGTTGTTATTGATGTTGTTGTTGATGGCA 2085 
III I I I I I I I I I III I I I I I I I I I I I II II 

Sbjct 488935 TGTTATTGTTGTTGTTGTTGTTGTTGTTGTTGACA 488969 



Features flanking this part of subject sequence: 

63869 bp at 5' side: ganglioside-induced differentiation-associated protein 1 ... 
184606 bp at 3 ' side: similar to hCG1799828 isoform 1 



Score =41.0 bits (44), Expect =7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 27193851 TTGTTGTTATTGTTGTTGTTGTTGTTG 27193825 



Features flanking this part of subject sequence: 

474603 bp at 5' side: cysteine-rich secretory protein LCCL domain containing 1 
33122 bp at 3 ' side: hepatocyte nuclear factor 4, gamma 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
St rand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I III I I I I I I I I I I I I I I 
Sbjct 28272460 TTGTTGTTGTTGTTGTTGTTGTTGATG 28272434 



>ref |NW_001842422 . 1 I HsY_WGA1383_36 Homo sapiens chromosome Y genomic contig, alternate ass 

(based on HuRef SCAF_1103279188414) 

Length=3942718 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 

sex determining region Y 



Score = 64.4 bits (70), Expect = 7e-07 
Identities = 94/126 (74%), Gaps = 8/126 (6%) 



Strand=Plus /Minus 




Query 


528 


AAGCGACCCATGAACGCCTTCATGGTGTGGTCGC-AGATCGAGCGGCGCAAGAT CAT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II II II II 1 
AAGCGACCCATGAACGCATTCATCGTGTGGTCTCGCGATC-AGAGGCGCAAGATGGCTCT 


583 


Sbjct 


8468 


8410 


Query 
Sbjct 


584 
8409 


GGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAA 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 
AGAG-AATC-CCAGA-ATGCGAAACTCAGAGATCAGCAAGCAGCTGGGATACCAGTGGAA 


643 
8353 


Query 


644 


GCTGCT 649 




Sbjct 


8352 


1 1 1 1 
AATGCT 8347 





Features flanking this part of subject sequence: 
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770513 bp at 5' side: TGFB-induced factor homeobox 2-like, Y-linked 
776921 bp at 3' side: hypothetical protein 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 
Strand=Plus /Minus 

Query 1149 GCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCA 1187 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 1579134 GCCTCGGCCTCTGCCTCCACCTCAGCCTCTGCCTCCACA 1579096 



>ref I NT_011896 . 9 I HsY_12053 ifi Homo sapiens chromosome Y genomic contig, reference assembly 
Length=6265435 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 

sex determining region Y 



Score = 64.4 bits (70), Expect = 7e-07 
Identities = 94/126 (74%), Gaps = 8/126 (6%) 



Strand=Plus /Minus 




Query 


528 


AAGCGACCCATGAACGCCTTCATGGTGTGGTCGC-AGATCGAGCGGCGCAAGAT CAT 


583 


Sbjct 


5944 


AAGCGACCCATGAACGCATTCATCGTGTGGTCTCGCGATC-AGAGGCGCAAGATGGCTCT 


5886 


Query 
Sbjct 


584 
5885 


GGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGGCAAACGCTGGAA 
1 II 1 II II 1 1 1 1 1 1 1 II 1 II II II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 
AGAG-AATC-CCAGA-ATGCGAAACTCAGAGATCAGCAAGCAGCTGGGATACCAGTGGAA 


643 
5829 


Query 


644 


GCTGCT 649 




Sbjct 


5828 


1 1 1 1 
AATGCT 5823 





Features flanking this part of subject sequence: 

675817 bp at 5' side: TGFB-induced factor homeobox 2-like, Y-linked 
777040 bp at 3' side: protocadherin 11 Y-linked isoform a 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 
Strand=Plus /Minus 

Query 1149 GCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCA 1187 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 1474178 GCCTCGGCCTCTGCCTCCACCTCAGCCTCTGCCTCCACA 1474140 



Features in this part of subject sequence: 
transducin beta-like lY 
transducin beta-like lY 



Score = 42.8 bits (46), Expect = 2.2 
Identities = 31/36 (86%), Gaps = 0/36 (0%) 
Strand=Plus/Plus 

Query 2047 GACCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 4284860 GAACTGTTTTTGTTGTTATTGTTGTTGTTGTTGTTG 4284895 



Features in this part of subject sequence: 
transducin beta-like lY 
transducin beta-like lY 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus/Plus 

Query 2051 TGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

III I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 4284873 TGTTATTGTTGTTGTTGTTGTTGTTGTTGTTG 4284904 
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>ref I NT_011333 . 5 I Hs20_11490 Homo sapiens chromosome 20 genomic contig, reference assembly 
Length=1702150 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
hypothetical protein 
SRY-box 18 

Score = 62.6 bits (68), Expect = 2e-06 
Identities = 83/114 (72%), Gaps = 6/114 (5%) 
Strand=Plus /Minus 

Query 521 GCACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGAT 580 

II I I I I III I I I I I I I I I 1 II I I I I I I I II I I I I II III I I I I I I I 
Sbjct 1417251 GCGCATCCGGCGGCCCATGAACGCCTTCATGGTGTGGGCAAAGGACGA GCGCAAGCG 1417195 

Query 581 CATGG AGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAGCGGCTGGG 631 

III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 1417194 GCTGGCTCAGCAGAACCCGGACCTGCACAACGCGGTGCTCAGCAAGATGCTGGG 1417141 

Features in this part of subject sequence: 
hypothetical protein 
SRY-box 18 

Score = 53.6 bits (58), Expect = 0.001 
Identities = 47/59 (79%), Gaps = 0/59 (0%) 
Strand=Plus /Minus 

Query 681 GAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 

II II II I I I I I I I I I I I I I I I II III I I I I I I I I II I I I I I I I I I I I 
Sbjct 1416 895 GAAGCCGAACGGCTGCGCGTGCAGCACTTGCGCGACCACCCCAACTACAAGTACCGGCC 1416 83 7 

>ref I NW_001838052 . 1 I Hsl2_WGA761_36 S Homo sapiens chromosome 12 genomic contig, alternate as 

(based on HuRef SCAF_1103279188408 ) 

Length=21675488 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 

SRY (sex determining region Y)-box 5 isoform c 
SRY (sex determining region Y)-box 5 isoform b 

Score = 57.2 bits (62), Expect = le-04 
Identities = 76/103 (73%), Gaps = 2/103 (1%) 
Strand=Plus /Minus 

Query 522 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTC-GCAGATCGAGCGGCGCAAGAT 580 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II III I I I I I I 

Sbjct 14103106 CACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGAT-GAACGGAGAAAGAT 14103048 

Query 581 CATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAG 623 

II I I I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct 14103047 CCTTCAAGCCTTTCCTGACATGCACAACTCCAACATCAGCAAG 14103005 

Features flanking this part of subject sequence: 

15642 bp at 5' side: hypothetical protein LOC440087 
585 bp at 3' side: hypothetical protein LOC144608 

Score =42.8 bits (46), Expect =2.2 
Identities = 29/33 (87%), Gaps = 0/33 (0%) 
Strand=Plus /Minus 

Query 2050 CTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 5384229 CTGTGTTTGTTGTTGTTGTTGTTGTTGTTGTTG 5384197 

Features flanking this part of subject sequence: 

70138 bp at 5' side: pleckstrin homology domain containing, family A member 5 
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463 bp at 3' side: AE binding protein 2 
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Score = 41.0 bits (44), Expect = 7.5 
Identities = 24/25 (96%), Gaps = 0/25 (0%) 
Strand=Plus /Minus 

Query 1144 CCAGCGCCTCGGCCTCCGCCTCCTC 1168 

III I I I I I I I I I I I I I I I I I I I I I 
Sbjct 10001352 CCACCGCCTCGGCCTCCGCCTCCTC 10001328 



>ref I NT_009714 . 16 I Hsl2_9871 18 Homo sapiens chromosome 12 genomic contig, reference assembly 

Length=27615668 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 

SRY (sex determining region Y)-box 5 isoform a 
SRY (sex determining region Y)-box 5 isoform b 



Score = 57.2 bits (62), Expect = le-04 
Identities = 76/103 (73%), Gaps = 2/103 (1%) 
Strand=Plus /Minus 

Query 522 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTC-GCAGATCGAGCGGCGCAAGAT 580 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II III I I I I II 

Sbjct 16455227 CACATAAAGCGTCCAATGAATGCCTTCATGGTGTGGGCTAAAGAT-GAACGGAGAAAGAT 16455169 

Query 581 CATGGAGCAGTCGCCCGACATGCACAACGCCGAGATCTCCAAG 623 

II I I M I I I I I I I I I I I I I I I I I I I I I I 

Sbjct 16455168 CCTTCAAGCCTTTCCTGACATGCACAACTCCAACATCAGCAAG 16455126 



Features flanking this part of subject sequence: 

15639 bp at 5' side: hypothetical protein LOC440087 
585 bp at 3' side: hypothetical protein LOC144608 



Score =42.8 bits (46), Expect =2.2 
Identities = 29/33 (87%), Gaps = 0/33 (0%) 
Strand=Plus /Minus 

Query 2050 CTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 7734259 CTGTGTTTGTTGTTGTTGTTGTTGTTGTTGTTG 7734227 



Features in this part of subject sequence: 
pregnancy-zone protein 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I M M II I I M I M M I I I II 
Sbjct 2067054 TTGTTGTTATTGTTGTTGTTGTTGTTG 2067080 



Features flanking this part of subject sequence: 

70051 bp at 5' side: pleckstrin homology domain containing, family A member 5 
462 bp at 3 ' side: AE binding protein 2 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 24/25 (96%), Gaps = 0/25 (0%) 
Strand=Plus /Minus 

Query 1144 CCAGCGCCTCGGCCTCCGCCTCCTC 1168 

III I I I I I I I I I I I I I I I I I I I I I 
Sbjct 12351770 CCACCGCCTCGGCCTCCGCCTCCTC 12351746 



>ref I NW_001838671 . 1 1 Hs20_WGA1230_36 1^ Homo sapiens chromosome 20 genomic contig, alternate a 
(based on HuRef SCAF_1103279188118 ) 

Length=1477768 
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Features in this part of subject sequence: 

SRY-box 18 



Score = 53.6 bits (58), Expect = 0.001 
Identities = 47/59 (79%), Gaps = 0/59 (0%) 
Strand=Plus /Minus 

Query 681 GAGGCGGAGCGGCTGCGCCTCAAGCACATGGCTGACTACCCCGACTACAAGTACCGGCC 739 

II II II I I I I I I I I I I I I I I I II III I I I I I I I I I I I I I I I I I I I I I 
Sbjct 1435018 GAAGCCGAACGGCTGCGCGTGCAGCACTTGCGCGACCACCCCAACTACAAGTACCGGCC 1434960 



>ref I NW_001838954 . 2 I Hs5_WGA347_36 Homo sapiens chromosome 5 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188298 ) 

Length=20179980 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 
78951 bp at 5' side: stanniocalcin 2 precursor 
3698 bp at 3 ' side: NK2 transcription factor related, locus 5 



Score =51.8 bits (56), Expect = 0.004 
Identities = 42/50 (84%), Gaps = 2/50 (4%) 
Strand=Plus /Minus 

Query 1137 CGGACTCCCAGCGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I I I I II II II II II II II II II I I I I I I I I I I I I I I I 
Sbjct 2701243 CGGACTCCCGGC — CTCGGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGC 2701196 



Features in this part of subject sequence: 
slit homolog 3 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 32/36 (88%), Gaps = 1/36 (2%) 
Strand=Plus/Plus 

Query 2423 CC" ",tttt; ■:CCTCCCTCTTTTCC-CCTTGCCCCCTC 2457 

I I I I I I I I I I I I I I I I I II III II I I I II I I I 
Sbjct 7061862 CCTTTTTTTCCTCCCTCTTATCCTCCCTTCCCCCTC 7061897 



Features flanking this part of subject sequence: 
555459 bp at 5' side: msh homeobox 2 
60915 bp at 3' side: HMP19 protein 



Score =42.8 bits (46), Expect =2.2 
Identities = 32/38 (84%), Gaps = 0/38 (0%) 
Strand=Plus /Minus 

Query 1999 CTCTGCCCAGCCGGAGGGACGCGGAGGAGGAAGAGGGT 2036 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 1770755 CTCTGCACAGCCGGACAGGTGAGGAGGAGGAAGAGGGT 1770718 



Features in this part of subject sequence: 

SRY (sex determining region Y)-box 30 isoform a 
SRY (sex determining region Y)-box 30 isoform b 



Score = 41.0 bits (44), Expect = 7.5 

Identities = 27/30 (90%), Gaps = 0/30 (0%) 

Strand=Plus/Plus 

Query 528 AAGCGACCCATGAACGCCTTCATGGTGTGG 557 

I I I I I I I I I I I I I I I I I II I I I II II 
Sbjct 18297850 AAGCGACCCATGAACGCATTTATGGTTTGG 18297879 



>ref I NT_011295 . 10 I Hsl9_11452 1.^ Homo sapiens chromosome 19 genomic contig, reference assembly 
Length=15825424 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 
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Features flanking this part of subject sequence: 
1925 bp at 5' side: TRMl tRNA methyltransf erase 1 
16847 bp at 3' side: transcriptional repressor NACl 



Score =51.8 bits (56), Expect = 0.004 
Identities = 34/38 (89%), Gaps = 0/38 (0%) 
Strand=Plus /Minus 

Query 1149 GCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Sbjct 4491977 GCCTCGGCCTCCGCCTCCGCGGCCTCCGCGGCCTCCGC 4491940 



Features in this part of subject sequence: 
calcium channel, alpha lA subunit isoform 2 
calcium channel, alpha lA subunit isoform 1 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 24/25 (96%), Gaps = 0/25 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGA 2080 

I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 4698340 TTGTTGTTGTTGATGTTGTTGTTGA 4698316 



>ref I NT_023133 . 12 I Hs5_23289 Homo sapiens chromosome 5 genomic contig, reference assembly 
Length=25714846 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 

3698 bp at 5' side: NK2 transcription factor related, locus 5 
79019 bp at 3 ' side: stanniocalcin 2 precursor 



Score = 51.8 bits (56), Expect = 0.004 
Identities = 42/50 (84%), Gaps = 2/50 (4%) 
Strand=Plus/Plus 

Query 1137 CGGACTCCCAGCGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGC 1186 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 17475370 CGGACTCCCGGC— CTCGGCCTCCGCCTCCGCCTCCGCCTCCGCCTCCGC 17475417 



Features in this part of subject sequence: 
slit homolog 3 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 32/36 (88%), Gaps = 1/36 (2%) 
Strand=Plus /Minus 

Query 2423 CCtfctttttCCTCCCTCTTTTCC-CCTTGCCCCCTC 2457 

I I I I I I I I I I I I I I I I I I I III II I I I I I I I I 
Sbjct 13112345 CCTTTTTTTCCTCCCTCTTATCCTCCCTTCCCCCTC 13112310 



Features flanking this part of subject sequence: 
60329 bp at 5' side: HMP19 protein 
556789 bp at 3' side: msh homeobox 2 



Score =42.8 bits (46), Expect =2.2 
Identities = 32/38 (84%), Gaps = 0/38 (0%) 
Strand=Plus/Plus 

Query 1999 CTCTGCCCAGCCGGAGGGACGCGGAGGAGGAAGAGGGT 2036 

II II II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 18404423 CTCTGCACAGCCGGACAGGTGAGGAGGAGGAAGAGGGT 18404460 



Features in this part of subject sequence: 

SRY (sex determining region Y)-box 30 isoform a 
SRY (sex determining region Y)-box 30 isoform b 
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Score = 41.0 bits (44), Expect = 7.5 
Identities = 27/30 (90%), Gaps = 0/30 (0%) 
Strand=Plus /Minus 
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Query 528 AAGCGACCCATGAACGCCTTCATGGTGTGG 557 

I I I I I I I I I I I I I I I I I II I I I I I III 
Sbjct 1885418 AAGCGACCCATGAACGCATTTATGGTTTGG 1885389 



>ref I NW_001838022 . 2 I Hsll_WGA731_36 Iv^ Homo sapiens chromosome 11 genomic contig, alternate as 

(based on HuRef SCAF_1103279188392 ) 

Length=42354830 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 

SRY (sex determining region Y)-box 6 isoform 1 
SRY (sex determining region Y)-box 6 isoform 2 



Score =50.0 bits (54), Expect = 0.014 
Identities = 65/90 (72%), Gaps = 0/90 (0%) 
Strand=Plus/Plus 

Query 522 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 581 

I I I I I I I I I I I I I I I I I I II I I I I I I I I III I II III II I II III 
Sbjct 34920132 CACATTAAGCGACCAATGAATGCATTCATGGTTTGGGCAAAGGATGAGAGGAGAAAAATC 34920191 

Query 582 ATGGAGCAGTCGCCCGACATGCACAACGCC 611 

I II I I I I I I I I I I I I I I I I I 
Sbjct 34920192 CTTCAGGCCTTCCCCGACATGCATAACTCC 34920221 



Features flanking this part of subject sequence: 

33 bp at 5' side: mitogen-activated protein kinase 8 interacting protein 1 
13517 bp at 3' side: cryptochrome 2 (photolyase-like ) 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 35/42 (83%), Gaps = 1/42 (2%) 
Strand=Plus /Minus 

Query 1155 GCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGCGCTCGCG 1196 

I II II II II II II M M I I I I I I I I I I I I I I I I I I 
Sbjct 5001886 GCCTCCGCCTCCTTCGCAGCCGCCGCCTCCTCCGCGC-CGCG 5001846 



Features flanking this part of subject sequence: 
1290185 bp at 5' side: mucin 15 
199821 bp at 3 ' side: leucine zipper protein 2 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I III I I I I I I I I I I I I I I 
Sbjct 25626780 TTGTTGTTGTTGTTGTTGTTGTTGATG 25626754 



>ref I NW_001838468 . 1 I Hsl8_WGA1177_36 il^ Homo sapiens chromosome 18 genomic contig, alternate a 

(based on HuRef SCAF_1103279188038) 
Length=7515650 

Features in this part of subject sequence: 

similar to hCG1790759 



Score = 50.0 bits (54), Expect = 0.014 
Identities = 35/40 (87%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGC-::--::aaaaaa.-::aa 2095 

II I I I I I I III I I I I I I I I I I I II I I I I I I I I I I I 
Sbjct 1473112 TTGTTGTTGTTGTTGTTGTTGTTGTTGAAAAAAAAAAAAA 1473073 



>ref I NW_001838951 . 1 1 Hs5_WGA344_36 ^ Homo sapiens chromosome 5 genomic contig, alternate asse 
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(based on HuRef SCAF_1103279188395) 
Length=20933881 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 
699382 bp at 5' side: myocyte enhancer factor 2C 
378040 bp at 3' side: hypothetical protein 



Score = 50.0 bits (54), Expect = 0.014 
Identities = 35/40 (87%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGC-;:.;=o«3aaa&-;:.u 2095 

I I I I I I I I III I I I I I I I I I I I I I I II II II II II 
Sbjct 18105201 TTGTTGTTGTTGTTGTTGTTGTTGATAGAAACAAAAAAAA 18105162 



Features flanking this part of subject sequence: 

1347233 bp at 5' side: EGF-like repeats and discoidin I-like domains-containing ... 
881684 bp at 3' side: cytochrome c oxidase subunit VIIc precursor 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCAAAAAAAAAA 2094 

I I I I I I I I I I I I I I I I II II III I I I I I I I I I I 
Sbjct 14316178 TTGTTGTTATTGTTGTTCTTATTTATGTTAAAAAAAAAA 14316216 



>ref I NW_001838865 . 2 I Hs2_WGA258_36 S Homo sapiens chromosome 2 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188138 ) 

Length=7557636 

Features flanking this part of subject sequence: 

283252 bp at 5' side: hypothetical protein LOC79843 

116307 bp at 3' side: plasminogen activator inhibitor type 1, member 2 



Score = 50.0 bits (54), Expect = 0.014 
Identities = 35/40 (87%), Gaps = 0/40 (0%) 
Strand=Plus/Plus 

Query 2047 GACCTGTGATTGTTGTTATTGATGTTGTTGTTGATGGCAA 2 086 

II II II I I I I I I I I I III I I I I I I I I I I I I I I I I I 
Sbjct 1525080 GACCTGGGATTTTTGTTGTTGTTGTTGTTGTTGTTGGCAA 1525119 



>ref I NT_006576 . 15 I Hs5_6733 S Homo sapiens chromosome 5 genomic contig, reference assembly 
Length=46378398 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 
1500958 bp at 5' side: hypothetical protein 
319712 bp at 3' side: cadherin 18, type 2 preproprotein 



Score =50.0 bits (54), Expect = 0.014 
Identities = 27/27 (100%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 19126380 TTGTTGTTATTGATGTTGTTGTTGATG 19126354 



Features flanking this part of subject sequence: 
2372 bp at 5' side: hypothetical protein 
695213 bp at 3' side: hypothetical protein 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus /Minus 
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Query 2628 GGGGAGCTGGCGGCGGCGGCTGCTGGGCCTCC 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 1877085 GGGGAGCGGGCGGCGGCGGCTGCGGCGACTCC 



Features in this part of subject sequence: 
KPL2 protein isoform 1 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2052 GTGATTGTTGTTATTGATGTTGTTGTT 2078 

III I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 35740888 GTGGTTGTTGTTATTGTTGTTGTTGTT 35740914 



>ref I NT_009237 . 17 I Hsll_9394 S Homo sapiens chromosome 11 genomic contig, reference assembly 
Length=49571094 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 

SRY (sex determining region Y)-box 6 isoform 2 
SRY (sex determining region Y)-box 6 isoform 1 



Score = 50.0 bits (54), Expect = 0.014 
Identities = 65/90 (72%), Gaps = 0/90 (0%) 
Strand=Plus /Minus 

Query 522 CACATCAAGCGACCCATGAACGCCTTCATGGTGTGGTCGCAGATCGAGCGGCGCAAGATC 581 

I I I I I I I I I I I I I II I I I II I I I I I I I I III I II III II I II III 
Sbjct 14797892 CACATTAAGCGACCAATGAATGCATTCATGGTTTGGGCAAAGGATGAGAGGAGAAAAATC 14797833 

Query 582 ATGGAGCAGTCGCCCGACATGCACAACGCC 611 

I II I i i i I I I I M I I I I I I I 
Sbjct 14797832 CTTCAGGCCTTCCCCGACATGCATAACTCC 14797803 



Features flanking this part of subject sequence: 
193583 bp at 5' side: leucine zipper protein 2 
1060010 bp at 3 ' side: transmembrane protein 16C 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2 056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I III I I I I I I I I I I I I I I 
Sbjct 24081028 TTGTTGTTGTTGTTGTTGTTGTTGATG 24081054 



Features flanking this part of subject sequence: 

13517 bp at 5' side: cryptochrome 2 (photolyase-like ) 

33 bp at 3' side: mitogen-activated protein kinase 8 interacting protein 1 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 35/42 (83%), Gaps = 1/42 (2%) 

Strand=Plus/Plus 

Query 1155 GCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGCGCTCGCG 1196 

I II II II II II II I I I I I I I I I I I I I I 1 II I I I I I 
Sbjct 44694540 GCCTCCGCCTCCTTCGCAGCCGCCGCCTCCTCCGCGC-CGCG 44694580 



>ref I NT_007819 . 16 I Hs7_7976 ll^ Homo sapiens chromosome 7 genomic contig, reference assembly 
Length=47690382 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 

627203 bp at 5' side: Sp8 transcription factor isoform 2 
15247 bp at 3' side: Sp4 transcription factor 
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Score =50.0 bits (54), Expect = 0.014 
Identities = 35/40 (87%), Gaps = 0/40 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCaa-u-aaaaaaaa 2095 

I I I I I I I I III I I I I I I I I I I I II I I I I I I I I I I I 
Sbjct 20941542 TTGTTGTTGTTGTTGTTGTTGTTGTTGTTAAAAAAAAAAA 20941581 



Features in this part of subject sequence: 
zinc finger/RING finger 2 



Score =44.6 bits (48), Expect =0.62 
Identities = 48/64 (75%), Gaps = 0/64 (0%) 
Strand=Plus/Plus 

Query 1143 CCCAGCGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGCGCTCGCGGCCCCG 1202 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 29814106 CCCAGCGCCTCCGGCGGCGCCGCGGCGGCCGCGGCGGCCCCGGCAGCCCCGGCGGCCCCG 29814165 

Query 1203 GGCA 1206 

I I I 

Sbjct 29814166 CGCA 29814169 



Features flanking this part of subject sequence: 

130830 bp at 5' side: T-box transcription factor TBX20 isoform B 
249206 bp at 3' side: HERPUD family member 2 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 30/34 (88%), Gaps = 0/34 (0%) 
Strand=Plus /Minus 

Query 2423 CCtttttttCCTCCCTCTTTTCCCCTTGCCCCCT 2456 

I I I I I I I I I I I I I I I I I I I I I I III I I I I I 
Sbjct 34913052 CCTTTTCTTCCTCCCTCTTTTCCTCTTCCTCCCT 34913019 



Features flanking this part of subject sequence: 

627200 bp at 5' side: Sp8 transcription factor isoform 2 
15248 bp at 3' side: Sp4 transcription factor 



Score =42.8 bits (46), Expect =2.2 
Identities = 35/42 (83%), Gaps = 2/42 (4%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG — GC-s-saaaaaaa-sa 2095 

I I I I I I I I III I I I I I I I I I I I II I I I I I I I I I I I 
Sbjct 20941539 TTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTAAAAAAAAAA 20941580 



Features in this part of subject sequence: 
transf ormer-2 alpha 



Score =42.8 bits (46), Expect =2.2 
Identities = 29/33 (87%), Gaps = 0/33 (0%) 
Strand=Plus/Plus 

Query 766 CCAACTCCAGCTCCTCGGCCGCCGCCTCCTCCA 798 

III Mill I I I I I I I I I I I I I I I I I I I I I 
Sbjct 23034796 CCACCTCCACCTCCACCGCCGCCGCCTCCTCCA 23034828 



Features in this part of subject sequence: 
sorting nexin 8 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus/Plus 

Query 1158 TCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 

I I I I I I I I I I I I I I I I I I I I I I I I II II 
Sbjct 1842936 TCCGCCTCCTCGTCAGCCTCCGCCTCAGCTGC 1842967 



Features flanking this part of subject sequence: 
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213509 bp at 5' side: serine/threonine kinase 31 isoform b 
239342 bp at 3 ' side: neuropeptide Y 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 24/25 (96%), Gaps = 0/25 (0%) 
Strand=Plus /Minus 

Query 2055 ATTGTTGTTATTGATGTTGTTGTTG 2079 

I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 23574476 ATTGTTGTTATTGTTGTTGTTGTTG 23574452 



Features flanking this part of subject sequence: 

29183 bp at 5' side: ras related v-ral simian leukemia viral oncogene homolog A 
67365 bp at 3' side: hypothetical protein 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2053 TGATTGTTGTTATTGATGTTGTTGTTG 2079 

I I I I I I I I I I I III I I I I I I I I I I I 
Sbjct 39264011 TGATTGTTGTTGTTGTTGTTGTTGTTG 39263985 



>ref I NT_010966 . 13 I Hsl8_11123 S Homo sapiens chromosome 18 genomic contig, reference assembly 
Length=33548238 

Features in this part of subject sequence: 
similar to hCG1790759 



Score =50.0 bits (54), Expect = 0.014 
Identities = 35/40 (87%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCaas:S:.:u;;3aaaa 2095 

I I I I I I I I III I I I I I I I I I I I II I I I I I I I I I I I 
Sbjct 27513661 TTGTTGTTGTTGTTGTTGTTGTTGTTGAAAAAAAAAAAAA 27513622 



>ref I NT_010393 . 15 I Hsl6_10550 S Homo sapiens chromosome 16 genomic contig, reference assembly 
Length=25336229 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 
508 bp at 5' side: ubiquitin specific protease 7 
128183 bp at 3' side: hypothetical protein LOC29035 



Score = 50.0 bits (54), Expect = 0.014 
Identities = 54/70 (77%), Gaps = 5/70 (7%) 
Strand=Plus/Plus 

Query 1142 TCCCAGCGCCTCGGCCTCCGCCTCCTCGGCAGCCTC GGCCTCC-GCAGCGCTCGCG 1196 

I I I I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I I I I I II II II 

Sbjct 370729 TCCGAGAGCCGCGGCCTCCGCCTCCTCGGCGTCGTCGTCGGGGCTCCGGCAGCGGACGCG 37078 

Query 1197 GCCCCGGGCA 1206 

Sbjct 370789 GCGCCCGGCA 370798 



Features in this part of subject sequence: 

poly (A) -specif ic ribonuclease (deadenylation nuclease) 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2052 GTGATTGTTGTTATTGATGTTGTTGTT 2078 

III I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 5859015 GTGGTTGTTGTTATTGTTGTTGTTGTT 5858989 



Features flanking this part of subject sequence: 
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71489 bp at 5' side: sodium channel, nonvoltage-gated 1, gamma 
61616 bp at 3' side: sodium channel, nonvoltage-gated 1, beta 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 14611358 TTGTTGTTGTTGATGTTGTTGTTGTTG 14611384 



>ref I NT_005403 . 16 1 Hs2_5560 ifi Homo sapiens chromosome 2 genomic contig, reference assembly 
Length=84213157 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 
127128 bp at 5' side: hypothetical protein 
282932 bp at 3 ' side: hypothetical protein LOC79843 



Score = 50.0 bits (54), Expect = 0.014 
Identities = 35/40 (87%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 2 0 47 GACCTGTGATTGTTGTTATTGATGTTGTTGTTGATGGCAA 2086 

I I I I I I I I I I I I I I I III I I I I I I I I I I I I I I I I I 
Sbjct 75191620 GACCTGGGATTTTTGTTGTTGTTGTTGTTGTTGTTGGCAA 75191581 



Features flanking this part of subject sequence: 

237306 bp at 5' side: UDP-N-acetyl-alpha-D-galactosamine : polypeptide N-acetylga. . . 
10897 bp at 3' side: potassium inwardly-rectifying channel J3 



Score =42.8 bits (46), Expect =2.2 
Identities = 23/23 (100%), Gaps = 0/23 (0%) 
Strand=Plus/Plus 

Query 2241 GTTTTGTTTTATTTTGCTTCTTG 2263 

II I I I I I I I I I I I I I I I I I I I I I 
Sbjct 5753787 GTTTTGTTTTATTTTGCTTCTTG 5753809 



Features in this part of subject sequence: 
homeobox Dll 



Score = 42.8 bits (46), Expect = 2.2 
Identities = 28/31 (90%), Gaps = 0/31 (0%) 
Strand=Plus /Minus 

Query 1159 CCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 

II II II II II I I I III I I I I I I I I I I I I 
Sbjct 27181871 CCGCCTCCTCGGCCGCCGCGGCCGCCGCAGC 27181841 



Features in this part of subject sequence: 
titin isoform N2-A 
titin isoform novex-1 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 29758383 TTGTTGTTGTTGATGTTGTTGTTGTTG 29758357 



>ref I NT_006713 . 14 I Hs5_6870 ^ Homo sapiens chromosome 5 genomic contig, reference assembly 
Length=42230486 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 
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Features flanking this part of subject sequence: 
699394 bp at 5' side: myocyte enhancer factor 2C 
377967 bp at 3' side: hypothetical protein 

Score =50.0 bits (54), Expect = 0.014 
Identities = 35/40 (87%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCaaasi-aaaaaaa 2095 

I I I I I I I I III I I I I I I I I I I I I I I II I I I I I I I I 
Sbjct 39413396 TTGTTGTTGTTGTTGTTGTTGTTGATAGAAACAAAAAAAA 39413357 

Features flanking this part of subject sequence: 

1351749 bp at 5' side: EGF-like repeats and discoidin I-like domains-containing ... 
881894 bp at 3 ' side: cytochrome c oxidase subunit VIIc precursor 

Score = 44.6 bits (48), Expect = 0.62 
Identities = 33/39 (84%), Gaps = 0/39 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCAAAAAAAAAA 2094 

I I I I I I I I I I I I I I I I II II III I I I I I I I I I I 
Sbjct 35626299 TTGTTGTTATTGTTGTTCTTATTTATGTTAAAAAAAAAA 35626337 

Features flanking this part of subject sequence: 

98714 bp at 5' side: interleukin 6 signal transducer isoform 2 precursor 
25158 bp at 3' side: ankyrin repeat domain 55 isoform 1 

Score = 41.0 bits (44), Expect = 7.5 
Identities = 30/34 (88%), Gaps = 1/34 (2%) 
Strand=Plus /Minus 

Query 2049 CCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 5965211 CCTGTG-TTGTTGTTGTTGTTGTTGTTGTTGTTG 5965179 

>ref I NW_001838061 . 2 I Hsl2_WGA770_36 S Homo sapiens chromosome 12 genomic contig, alternate as 

(based on HuRef SCAF_1103279188362 ) 

Length=32873191 



Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 



Features flanking this part of subject sequence: 

63961 bp at 5' side: hypothetical protein LOC196477 
1181244 bp at 3' side: hypothetical protein 

Score =46.4 bits (50), Expect =0.18 
Identities = 34/40 (85%), Gaps = 0/40 (0%) 

Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCaaaaaaasiaaa 2095 

I I I I I I I I III II II II II II I II I I I I I I I I I I 
Sbjct 18088785 TTGTTGTTGTTGTTGTTGTTGTTGTTGTTTAAAAAAAAAA 18088824 

Features flanking this part of subject sequence: 

4678 bp at 5' side: damage-regulated autophagy modulator 

42189 bp at 3 ' side: N-acetylglucosamine-l-phosphate transferase 

Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 7113244 TTGTTGTTATTGTTGTTGTTGTTGTTG 7113270 

Features flanking this part of subject sequence: 

15027 bp at 5' side: IKK interacting protein isoform 1 

9365 bp at 3' side: solute carrier family 25 member 3 isoform b precursor 
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Score =41.0 bits (44), Expect =7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus/Plus 

Query 2051 TGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 10374017 TGTGTTTGTTGTTGTTGTTGTTGTTGTTGTTG 10374048 



Features flanking this part of subject sequence: 
11252 bp at 5' side: tetraspanin 19 

111109 bp at 3' side: solute carrier family 6, member 15 isoform 1 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Minus 

Query 2059 TTGTTATTGATGTTGTTGTTGATGGCA 2085 

I I I I I III I I I I I I I I I I I I I I I I I 

Sbjct 23985705 TTGTTGTTGTTGTTGTTGTTGATGGCA 23985679 



Features flanking this part of subject sequence: 

1244859 bp at 5' side: solute carrier family 6, member 15 isoform 2 

505592 bp at 3' side: transmembrane and tetratricopeptide repeat containing 2 



Score =41.0 bits (44), Expect =7.5 
Identities = 29/32 (90%), Gaps = 1/32 (3%) 
Strand=Plus /Minus 

Query 2049 CCTGT-GATTGTTGTTATTGATGTTGTTGTTG 2079 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 25350079 CCTGTAGTTTGTTGTTATTGTTGTTGTTGTTG 25350048 



Features flanking this part of subject sequence: 
60181 bp at 5' side: hypothetical protein 
36799 bp at 3 ' side: synaptotagmin I 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 27/30 (90%), Gaps = 0/30 (0%) 
St rand=Plus /Minus 

Query 2053 TGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

II I I I I I I I I III I I I I I I I I I I I I I I 
Sbjct 29504024 TGTTTGTTGTTGTTGTTGTTGTTGTTGATG 29503995 



>ref |NW_001838006 . 2 I HslO_WGA715. 36 S Homo sapiens chromosome 10 genomic contig, alternate as 

(based on HuRef SCAF_1103279188397) 

Length=24511762 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 

21875 bp at 5' side: VPSIO domain receptor protein SORCS 3 
164826 bp at 3' side: coiled-coil domain containing 147 



Score = 46.4 bits (50), Expect = 0.18 
Identities = 28/30 (93%), Gaps = 0/30 (0%) 
Strand=Plus/Plus 

Query 2 050 CTGTGATTGTTGTTATTGATGTTGTTGTTG 2 079 

I I I I I I I I I I I I I I III I I I I I I I I I I I 
Sbjct 18620835 CTGTGATTGTTGTTGTTGTTGTTGTTGTTG 18620864 



Features in this part of subject sequence: 
attractin-like 1 



Score =42.8 bits (46), Expect =2.2 
Identities = 31/36 (86%), Gaps = 0/36 (0%) 
Strand=Plus/Plus 
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Query 2047 GACCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I M I I I I I I I I III I I I I I I I I I I I II 
Sbjct 7796118 GACCTATGCTTGTTGTTGTTGTTGTTGTTGTTGTTG 7796153 



Features in this part of subject sequence: 

eukaryotic translation initiation factor 3, subunit 10 th. . . 



Score =41.0 bits (44), Expect =7.5 
Identities = 30/34 (88%), Gaps = 1/34 (2%) 
Strand=Plus/Plus 

Query 2049 CCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 4199496 CCTGTG-TTGTTGTTTTTGTTGTTGTTGTTGTTG 4199528 



>ref I NW_001838988 . 2 I Hs6_WGA381_36 S Homo sapiens chromosome 6 genomic contig, alternate asse 

(based on HuRef SCAF_1103279187031 ) 

Length=10950630 

Features flanking this part of subject sequence: 
5780 bp at 5' side: PR domain containing 13 
31753 bp at 3' side: cyclin C isoform a 



Score = 46.4 bits (50), Expect = 0.18 
Identities = 30/32 (93%), Gaps = 1/32 (3%) 
Strand=Plus /Minus 

Query 2 423 CCtttt ■■ tttCCTCCCTCTTTTCCCCTTGCCC 2 453 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 6415009 CCTTTTATTTCTTCCCTCTTTTCCCCTTGCCC 6414978 



>ref I NT_025741 . 14 I Hs6_25897 il^ Homo sapiens chromosome 6 genomic contig, reference assembly 
Length=61645385 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 
39819 bp at 5' side: cyclin C isoform b 
5780 bp at 3 ' side: PR domain containing 13 



Score = 46.4 bits (50), Expect = 0.18 
Identities = 30/32 (93%), Gaps = 1/32 (3%) 
Strand=Plus/Plus 

Query 2423 CCt t:.".t -tttCCTCCCTCTTTTCCCCTTGCCC 2453 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 4218557 CCTTTTATTTCTTCCCTCTTTTCCCCTTGCCC 4218588 



Features flanking this part of subject sequence: 

44704 bp at 5' side: cell division cycle 2-like 6 (CDK8-like) 

15240 bp at 3' side: S-adenosylmethionine decarboxylase 1 isoform 1 precursor 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 

Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I II I I II 
Sbjct 15350472 TTGTTGTTGTTGATGTTGTTGTTGTTG 15350498 



Features flanking this part of subject sequence: 
536408 bp at 5' side: connexin 43 

415307 bp at 3 ' side: heat shock transcription factor 2 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I III I I I I I I I I I I I I I I 
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Features flanking this part of subject sequence: 

422057 bp at 5' side: sterile alpha motif domain containing 5 
356529 bp at 3' side: SAM and SH3 domain containing 1 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2237 CTTGGTTTTGTTTTATTTTGCTTCTTG 2263 

III I I I I I I II II II II I I I I I III 
Sbjct 52412078 CTTTGTTTTGTTTTATTTTGCTTTTTG 52412104 



Features in this part of subject sequence: 
opioid receptor, mu 1 isoform MOR-10 
phosphoinositide-binding protein PIP3-E 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 29/32 (90%), Gaps = 1/32 (3%) 
St rand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCAAA 2087 

I I I I I I I I III I I I I I I I I I I I I I I I I I I 
Sbjct 58617258 TTGTTGTTGTTGTTGTTGTTGTTGA-GGCAAA 58617228 



>ref I NT_113898 . 1 1 Hs6_111617 Homo sapiens chromosome 6 genomic contig, reference assembly 
Length=1305230 

Features flanking this part of subject sequence: 
539 bp at 5' side: similar to HGC6 . 4 

105 bp at 3' side: similar to myeloid/lymphoid or mixed-lineage leukemia (tr... 



Score = 46.4 bits (50), Expect = 0.18 
Identities = 31/35 (88%), Gaps = 0/35 (0%) 
Strand=Plus /Minus 

Query 1155 GCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 

I I I I I I II II II II I I I I I I I I I I I I I I I II 
Sbjct 814424 GCCTCCGCCTCCGCGGCTGCCTCCGCCTCCGCGGC 814390 



>ref I NT_007302 . 13 I Hs6_7459 fe^ Homo sapiens chromosome 6 genomic contig, reference assembly 
Length=2236975 

Features flanking this part of subject sequence: 
539 bp at 5' side: hypothetical protein LOC653483 

105 bp at 3' side: myeloid/lymphoid or mixed-lineage leukemia (trithorax hom. . . 



Score = 46.4 bits (50), Expect = 0.18 
Identities = 31/35 (88%), Gaps = 0/35 (0%) 
Strand=Plus /Minus 

Query 1155 GCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 185635 GCCTCCGCCTCCGCGGCTGCCTCCGCCTCCGCGGC 185601 



>ref I NT_030059 . 12 I HslO_30314 S Homo sapiens chromosome 10 genomic contig, reference assembly 
Length=44617998 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 

164871 bp at 5' side: coiled-coil domain containing 147 
21898 bp at 3' side: VPSIO domain receptor protein SOROS 3 



Score =46.4 bits (50), Expect =0.18 
Identities = 28/30 (93%), Gaps = 0/30 (0%) 
Strand=Plus /Minus 
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Query 2050 CTGTGATTGTTGTTATTGATGTTGTTGTTG 2079 

I I I I I M I I I I I I I III I I I I I I I I I I I 
Sbjct 25127714 CTGTGATTGTTGTTGTTGTTGTTGTTGTTG 25127685 



Score =42.8 bits (46), Expect =2.2 
Identities = 31/36 (86%), Gaps = 0/36 (0%) 
Strand=Plus /Minus 

Query 2047 GACCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I II I I I I I I I I III I I I I I I I I I I I II 
Sbjct 35956938 GACCTATGCTTGTTGTTGTTGTTGTTGTTGTTGTTG 35956903 

Features flanking this part of subject sequence: 

48094 bp at 5' side: protein phosphatase 1, regulatory (inhibitor) subunit 30 
117589 bp at 3' side: tankyrase, TRFl-interacting ankyrin-related ADP-ribose P' 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 33/40 (82%), Gaps = 0/40 (0%) 
Strand=Plus/Plus 

Query 2043 GGGCGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 12189336 GGGCTTCCTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTG 12189375 



Features in this part of subject sequence: 

eukaryotic translation initration factor 3, subunit 10 th. . . 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 30/34 (88%), Gaps = 1/34 (2%) 
Strand=Plus /Minus 

Query 2049 CCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I III M I I I I I I I I I II 
Sbjct 39551677 CCTGTG-TTGTTGTTTTTGTTGTTGTTGTTGTTG 39551645 



>ref I NW_001838769 . 1 1 Hs2_WGA162_36 ^ Homo sapiens chromosome 2 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188433 ) 

Length=57109151 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
neurexin 1 isoform alpha precursor 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 26/27 (96%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2 053 TGATTGTTGTTATTGATGTTGTTGTTG 2079 

I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 20778224 TGATTGTTGTTATTGTTGTTGTTGTTG 20778198 



Features in this part of subject sequence: 
hypothetical protein 



Score =42.8 bits (46), Expect =2.2 
Identities = 66/91 (72%), Gaps = 9/91 (9%) 
Strand=Plus/Plus 

Query 1277 GGGCGGCGTGGGCGCGGGAGCCGAC— CCCAGCGACCCCCTGGGCCTGTACGAGGAGGAG 1334 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct 7734878 GGGCGGCGCGGG-GCGGG-GCCGGCGGCGCGGGGACTCACTGGGGCGG GGCGGAG 7734930 

Query 1335 GGCGCGGGCTGCTCGCCCGACGCGCCCAGCC 1365 

I I I I I I I I I II I I I I I I I I I I I I I I 
Sbjct 7734931 GGCGCGGGCAGCGGGGCCGACCCTCCCAGCC 7734961 
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Features in this part of subject sequence: 
sterolin 1 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 13885678 TTGTTGTTATTGTTGTTGTTGTTGTTG 13885704 



Features flanking this part of subject sequence: 
23818 bp at 5' side: M-phase phosphoprotein 10 

14630 bp at 3 ' side: poly (A) binding protein interacting protein 2B 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 41233894 TTGTTGTTATTGTTGTTGTTGTTGCTG 41233868 



Features flanking this part of subject sequence: 
16873 bp at 5 ' side: STAM binding protein 
23234 bp at 3' side: actin, gamma 2 propeptide 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 24/25 (96%), Gaps = 0/25 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGA 2080 

I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 43937347 TTGTTGTTATTGTTGTTGTTGTTGA 43937323 



>ref I NW_001838115 . 2 I Hsl4_WGA824_36 S Homo sapiens chromosome 14 genomic contig, alternate as 

(based on HuRef SCAF_1103279188327) 

Length=8970161 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
REST corepressor 1 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 35/42 (83%), Gaps = 0/42 (0%) 
Strand=Plus /Minus 

Query 1148 CGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 1645733 CGCCTCGGGCGCCGCCGCCTCCTCAGCCTCGGCCGCCGCCGC 1645692 



Features flanking this part of subject sequence: 

73361 bp at 5' side: SET domain containing 3 isoform b 
65304 bp at 3 ' side: B-cell CLL/lymphoma IIB isoform 2 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 24/25 (96%), Gaps = 0/25 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGA 2080 

II I I I I I I I I I I I I II II II II M 
Sbjct 4895427 TTGTTGTTATTGTTGTTGTTGTTGA 4895403 



>ref I NW_001838461 . 1 1 Hsl8_WGA1170_36 & Homo sapiens chromosome 18 genomic contig, alternate a 

(based on HuRef SCAF_1103279188344) 

Length=10560353 

Features flanking this part of subject sequence: 
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52573 bp at 5' side: zinc finger protein 161 homolog 

48166 bp at 3' side: erythrocyte membrane protein band 4.1-like 3 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 29/32 (90%), Gaps = 0/32 (0%) 
Strand=Plus /Minus 

Query 2050 CTGTGATTGTTGTTATTGATGTTGTTGTTGAT 2081 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 5299965 CTGTTGTTGTTGTTATTGTTGTTGTTGTTGAT 5299934 



>ref I NW_001838768 . 1 I Hs2_WGA161_36 ktl Homo sapiens chromosome 2 genomic contig, alternate asse 
(based on HuRef SCAF_1103279188306) 
Length=8993619 



Features flanking this part of subject sequence: 
9750 bp at 5' side: similar to hCG1783917 

35597 bp at 3' side: ATPase family, AAA domain containing 2B 



Score =44.6 bits (48), Expect =0.62 
Identities = 35/42 (83%), Gaps = 0/42 (0%) 
Strand=Plus/Plus 

Query 2041 AGGGGCGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 20 82 

III I I I I I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 2769192 AGGTGGGACCTTTCATTGTTGTTGTTGTTGTTGTTGTTGTTG 2769233 



>ref I NW_0018389B7 . 1 I Hs6_WGA380_36 Homo sapiens chromosome 6 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188274B) 

Length=25130961 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 

673617 bp at 5' side: similar to OTTHUMP00000016822 
14999 bp at 3' side: similar to hCG1652647 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 29/32 (90%), Gaps = 0/32 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCAAA 2087 

I I I I I I I I I I I I I I I I I I I I I I I III III 
Sbjct 16521782 TTGTTGTTATTGTTGTTGTTGTTGTTGGAAAA 16521813 



Features flanking this part of subject sequence: 
877536 bp at 5' side: hypothetical protein 
56773 bp at 3' side: similar to PR02122 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 

Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I II II 
Sbjct 11600965 TTGTTGTTATTGTTGTTGTTGTTGTTG 11600939 



>ref I NW_001838928 . 2 I Hs5_WGA321_36 Homo sapiens chromosome 5 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188369 ) 

Length=3866709 

Features flanking this part of subject sequence: 

318147 bp at 5' side: cadherin 18, type 2 preproprotein 
1493882 bp at 3' side: hypothetical protein 



Score =44.6 bits (48), Expect =0.62 
Identities = 26/27 (96%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 
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Sbjct 2312131 

>ref I NW_001838915 . 1 I Hs4_WGA308_36 l3 Homo sapiens chromosome 4 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188399 ) 

Length=43867763 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 

calcium/calmodulin-dependent protein kinase II delta isof . . . 
calcium/calmodulin-dependent protein kinase II delta isof... 

Score = 44.6 bits (48), Expect = 0.62 
Identities = 29/32 (90%), Gaps = 0/32 (0%) 
Strand=Plus /Minus 

Query 2 051 TGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

III I I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 38913468 TGTTATTGTTGTTATTGTTGTTGTTGTTGTTG 38913437 

Features flanking this part of subject sequence: 

133705 bp at 5' side: protein kinase, cGMP-dependent, type II 
89301 bp at 3 ' side: RasGEF domain family, member IB 

Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus/Plus 

Query 2 051 TGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 6765249 TGTGTTTGTTGTTGTTGTTGTTGTTGTTGTTG 6765280 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2053 TGATTGTTGTTATTGATGTTGTTGTTG 2079 

III I II II II II II I I I I I I I I I I I 
Sbjct 12614178 TGAGTGTTGTTATTGTTGTTGTTGTTG 12614204 

Features flanking this part of subject sequence: 

29889 bp at 5' side: protein phosphatase IK (PP2C domain containing) 
70965 bp at 3' side: hect domain and RLD 6 

Score = 41.0 bits (44), Expect = 7.5 
Identities = 24/25 (96%), Gaps = 0/25 (0%) 
Strand=Plus/Plus 

Query 2055 ATTGTTGTTATTGATGTTGTTGTTG 2079 

I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 13742614 ATTGTTGTTATTGTTGTTGTTGTTG 13742638 

Features flanking this part of subject sequence: 
278276 bp at 5' side: hypothetical protein 
1070253 bp at 3 ' side: hypothetical protein LOC132720 

Score = 41.0 bits (44), Expect = 7.5 
Identities = 24/25 (96%), Gaps = 0/25 (0%) 
Strand=Plus /Minus 

Query 2055 ATTGTTGTTATTGATGTTGTTGTTG 2079 

I I I I I I I I I I I I I I I I I II I I I I I 
Sbjct 36489587 ATTGTTGTTATTGATGCTGTTGTTG 36489563 

Features flanking this part of subject sequence: 
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829674 bp at 5' side: translocation associated membrane protein 1-like 1 
140202 bp at 3' side: N-deacetylase/N-sulfotransf erase (heparan glucosaminyl ) 3 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 27/30 (90%), Gaps = 0/30 (0%) 
Strand=Plus/Plus 

Query 2053 TGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

II I I I I I I I I I I I I I I I I I I I M I I II 
Sbjct 43331849 TGTTTGTTGTTATTGTTGTTGTTGTTGCTG 43331878 



>ref I NW_001838878 . 1 1 Hs3_WGA271_36 Homo sapiens chromosome 3 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188187) 

Length=20655341 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
roundabout 1 isoform a 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 32/37 (86%), Gaps = 0/37 (0%) 
Strand=Plus/Plus 

Query 2 0 46 CGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

III I I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 12950732 CGAACTGTGTTTGTTGTTGTTGTTGTTGTTGTTGTTG 12950768 



Features flanking this part of subject sequence: 
605382 bp at 5' side: roundabout 1 isoform b 
1368845 bp at 3' side: hypothetical protein 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 27/30 (90%), Gaps = 0/30 (0%) 
Strand=Plus /Minus 

Query 2 056 TTGTTGTTATTGATGTTGTTGTTGATGGCA 2085 

II M M I I III I I I I I I I I I I I I I I II 
Sbjct 13425064 TTGTTGTTGTTGTTGTTGTTGTTGATGCCA 13425035 



>ref I NW_001838953 . 2 I Hs5_WGA346_36 k& Homo sapiens chromosome 5 genomic contig, alternate asse 

(based on HuRef SCAF_1103279188146 ) 

Length=16305437 

Features in this part of subject sequence: 
transmembrane protein 173 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 30/34 (88%), Gaps = 0/34 (0%) 
Strand=Plus/Plus 

Query 2049 CCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 16280531 CCTGTGTTTGTTGTTGTTGTTGTTGTTGTTGTTG 16280564 



>ref I NT_026437 . 11 1 Hsl4_26604 S Homo sapiens chromosome 14 genomic contig, reference assembly 
Length=88290585 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
REST corepressor 1 



Score =44.6 bits (48), Expect =0.62 
Identities = 35/42 (83%), Gaps = 0/42 (0%) 
Strand=Plus/Plus 

Query 1148 CGCCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 
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Sbjct 84059111 

Features flanking this part of subject sequence: 

3679 bp at 5' side: olfactorY receptor, family 11, subfamily H, member 1 
171138 bp at 3' side: protein expressed in prostate, ovary, testis, and placent. 

Score =41.0 bits (44), Expect =7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 382279 TTGTTGTTGTTGATGTTGTTGTTGTTG 382253 

Features flanking this part of subject sequence: 
44156 bp at 5' side: hypothetical protein 

412677 bp at 3' side: olfactory receptor, family 4, subfamily Q, member 3 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 802724 TTGTTGTTGTTGATGTTGTTGTTGTTG 802750 



Features flanking this part of subject sequence: 
418658 bp at 5 ' side: hypothetical protein 

38175 bp at 3' side: olfactory receptor, family 4, subfamily Q, member 3 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 1177226 TTGTTGTTGTTGATGTTGTTGTTGTTG 1177252 



Features flanking this part of subject sequence: 
1557415 bp at 5' side: MAM domain containing 1 
886142 bp at 3' side: ribosomal protein S29 isoform 2 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 30158138 TTGTTGTTGTTGATGTTGTTGTTGTTG 30158112 



Features flanking this part of subject sequence: 
50279 bp at 5' side: fermitin family homolog 2 
45895 bp at 3 ' side: DDHD domain containing 1 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
St rand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I III I I I I I I I I I I I I I I 
Sbjct 34467341 TTGTTGTTGTTGTTGTTGTTGTTGATG 34467315 



>ref I NT_016354 . 18 I Hs4_16510 ^ Homo sapiens chromosome 4 genomic contig, reference assembly 
Length=92123751 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
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calcium/calmodulin-dependent protein kinase II delta isof... 
calcium/calmodulin-dependent protein kinase II delta isof... 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 29/32 (90%), Gaps = 0/32 (0%) 
Strand=Plus /Minus 

Query 2051 TGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

III I I I I I I I I I I I I I I I I I I I I II II II 
Sbjct 38971724 TGTTATTGTTGTTATTGTTGTTGTTGTTGTTG 38971693 



Features flanking this part of subject sequence: 

133593 bp at 5' side: protein kinase, cGMP-dependent, type II 
89097 bp at 3 ' side: RasGEF domain family, member IB 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus/Plus 

Query 2051 TGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 6807515 TGTGTTTGTTGTTGTTGTTGTTGTTGTTGTTG 6807546 



Features in this part of subject sequence: 
kelch-like 8 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2053 TGATTGTTGTTATTGATGTTGTTGTTG 2079 

III I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 12656097 TGAGTGTTGTTATTGTTGTTGTTGTTG 12656123 



Features flanking this part of subject sequence: 

29879 bp at 5' side: protein phosphatase IK (PP2C domain containing) 
70436 bp at 3' side: hect domain and RLD 6 



Score =41.0 bits (44), Expect =7.5 
Identities = 24/25 (96%), Gaps = 0/25 (0%) 
Strand=Plus/Plus 

Query 2055 ATTGTTGTTATTGATGTTGTTGTTG 
I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 13777335 ATTGTTGTTATTGTTGTTGTTGTTG 



Features in this part of subject sequence: 
hypothetical protein LOC54848 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCAAA 2087 

II I I I I I III I I I I I I I I I I I I I I I I I I 
Sbjct 31046853 TTTTTGTTGTTGTTGTTGTTGTTGTTGGCAAA 31046884 



Features flanking this part of subject sequence: 
278486 bp at 5' side: hypothetical protein 
1070062 bp at 3' side: hypothetical protein LOC132720 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 24/25 (96%), Gaps = 0/25 (0%) 
Strand=Plus /Minus 

Query 2055 ATTGTTGTTATTGATGTTGTTGTTG 2079 

II II M M I I I I I I I I I I I I I I I I 
Sbjct 36544821 ATTGTTGTTATTGATGCTGTTGTTG 36544797 



Features in this part of subject sequence: 
ankyrin 2 isoform 1 



2079 

13777359 
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Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 

Sbjct 38574052 TTGTTGTTATTGTTGTTGTTGTTGTTG 38574026 



Features flanking this part of subject sequence: 

828203 bp at 5' side: translocation associated membrane protein 1-like 1 
140285 bp at 3' side: N-deacetylase/N-sulfotransf erase (heparan glucosaminyl ) 3 



Score =41.0 bits (44), Expect =7.5 
Identities = 27/30 (90%), Gaps = 0/30 (0%) 
Strand=Plus/Plus 

Query 2053 TGATTGTTGTTATTGATGTTGTTGTTGATG 
II I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 43382897 TGTTTGTTGTTATTGTTGTTGTTGTTGCTG 



Features flanking this part of subject sequence: 
859109 bp at 5' side: hypothetical protein 
1066931 bp at 3 ' side: hypothetical protein 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I III I I I I I I I I I I I I I I 
Sbjct 56126973 TTGTTGTTGTTGTTGTTGTTGTTGATG 56126947 



Features flanking this part of subject sequence: 

323925 bp at 5' side: protocadherin 10 isoform 2 precursor 

1467518 bp at 3' side: similar to Ubiquinol-cytochrome c reductase, complex III ... 



Score =41.0 bits (44), Expect =7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus /Minus 

Query 2051 TGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I III III I I I I I I I II 
Sbjct 58946089 TGTGATTGTTGTTGTTGTTGTCGTTGTTGTTG 58946058 



Features in this part of subject sequence: 

similar to FRASl-related extracellular matrix protein 3 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 69094402 TTGTTGTTGTTGATGTTGTTGTTGTTG 69094376 



Features flanking this part of subject sequence: 
39101 bp at 5' side: peptidylprolyl isomerase D 
6793 bp at 3' side: hypothetical protein LOC57600 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus /Minus 

Query 2051 TGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I III II I I I I I I I I I II 
Sbjct 84231719 TGTGTTTGTTGTTGTTGTTGTTGTTGTTGTTG 84231688 



Features flanking this part of subject sequence: 

462638 bp at 5' side: membrane-associated RING-CH protein I 



2082 

43382926 
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120826 bp at 3' side: acidic nuclear phosphoprotein 32C 
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Score = 41.0 bits (44), Expect = 7.5 
Identities = 33/40 (82%), Gaps = 0/40 (0%) 
Strand=Plus /Minus 

Query 2043 GGGCGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I II III I I I I I I I I III I I I I I I I I I I I II 
Sbjct 89545480 GGGCTACTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTG 89545441 

>ref I NT_023935 . 17 I Hs9_24091 18 Homo sapiens chromosome 9 genomic contig, reference assembly 

Length=21507948 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features flanking this part of subject sequence: 

383166 bp at 5' side: transducin-like enhancer protein 4 
1475853 bp at 3' side: transducin-like enhancer protein 1 

Score = 44.6 bits (48), Expect = 0.62 
Identities = 33/38 (86%), Gaps = 2/38 (5%) 
Strand=Plus/Plus 

Query 2059 TTGTTATTGATGTTGTTGTTGATGGC<H5aaaaa,-::.-:;.aaG 2096 

I I I I I III II II II II II I I I I I I I I I I I I I I I 
Sbjct 11887757 TTGTTGTTGTTGTTGTTGTTGTT — CAAAAAAAAAAAG 11887792 

Features flanking this part of subject sequence: 
534 bp at 5' side: growth arrest-specific 1 
164983 bp at 3' side: similar to hCG1817457 

Score =44.6 bits (48), Expect =0.62 
Identities = 49/64 (76%), Gaps = 5/64 (7%) 
Strand=Plus/Plus 

Query 1133 GGCGCGGACTCCCAGCG CCTCGGCCTCCGCCTCCTCGGCAGCCTCGGCCTCCGCAGC 1189 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 18726760 GGCCCGGCCGCCCCGCGTCCCCTCCCCCTCCGCC — CGCGGCAACCCCGGCCTCCGCAGA 18726817 

Query 1190 GOTO 1193 

I I I I 

Sbjct 18726818 GCTC 18726821 

Features flanking this part of subject sequence: 

705126 bp at 5' side: transducin-like enhancer protein 4 
1153902 bp at 3' side: transducin-like enhancer protein 1 

Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I III I I I I I I I I I I I I I I 
Sbjct 12209743 TTGTTGTTGTTGTTGTTGTTGTTGATG 12209717 

>ref I NT_008470 . 18 I Hs9_8627 S Homo sapiens chromosome 9 genomic contig, reference assembly 
Length=40394265 

Sort alignments for this subject seq 
E value Score Percent identity 
Query start position Subject star 

Features in this part of subject sequence: 
KIAA0368 protein 

Score = 44.6 bits (48), Expect = 0.62 
Identities = 27/29 (93%), Gaps = 0/29 (0%) 
Strand=Plus/Plus 

Query 1000 
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Sbjct 21566991 AGCTCATCCTGGGAGGCGGAGGCGGCGGC 21567019 
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Features in this part of subject sequence: 
DAB2 interacting protein isoform 1 



Score =42.8 bits (46), Expect =2.2 
Identities = 39/49 (79%), Gaps = 3/49 (6%) 
Strand=Plus/Plus 

Query 298 CGGCGGCCGCC GCGAGGGTGTGAGCGCGCGTGGGCGCCCGCCGAGC 343 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct 31735142 CGGCGGCCGCTCGGGCGAGCGCGGGAGAACGCGTGGGCGCCCGCCGGGC 31735190 



Features in this part of subject sequence: 
paralemmin 2 isoform b 
paralemmin 2 isoform a 



Score =41.0 bits (44), Expect =7.5 
Identities = 34/41 (82%), Gaps = 2/41 (4%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCaaas:.:u;aaaaaG 2096 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Sbjct 19981612 TTGTTGTTATTGTTGTTGTTGTCGTT — TAAGAAAAAAAAG 19981650 



Features in this part of subject sequence: 

Ral GEF with PH domain and SH3 binding motif 1 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 30/35 (85%), Gaps = 0/35 (0%) 
Strand=Plus/Plus 



Features flanking this part of subject sequence: 
33219 bp at 5' side: roundabout 1 isoform a 
1369909 bp at 3' side: hypothetical protein 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 27/30 (90%), Gaps = 0/30 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCA 
II I I I I I I III I I I I I I I I I I I I I I II 
Sbjct 13379166 TTGTTGTTGTTGTTGTTGTTGTTGATGCCA 



eq 
ar 



2085 

13379137 



Query 2048 ACCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 37143991 ACCTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTG 37144025 



>ref I NT_022459 . 14 I Hs3_22615 S Homo sapiens chromosome 3 genomic contig, reference assembly 
Length=24211711 

Sort alignments for this subject s 
E value Score Percent identity 
Query start position Subject st 

Features in this part of subject sequence: 
roundabout 1 isoform a 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 32/37 (86%), Gaps = 0/37 (0%) 
Strand=Plus/Plus 

Query 2046 CGACCTGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

III I I I I I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 12905511 CGAACTGTGTTTGTTGTTGTTGTTGTTGTTGTTGTTG 12905547 



>ref I NT_032977 . 8 I Hsl_33153 Homo sapiens chromosome 1 genomic contig, reference assembly 
Length=73835825 

Sort alignments for this subject seq 
E value Score Percent identity 
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Query start position Subject star 

Features flanking this part of subject sequence: 

216269 bp at 5' side: leucine rich repeat containing 8 family, member C 
1792 bp at 3' side: leucine rich repeat containing 8 family, member D 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 26/27 (96%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2768 GGTTTCGGaaaa-aaaaaaag-uaaaaaa 2794 

I I I I I I I I I I I I I I I I I I I I I I I I I I 

Sbjct 60368729 GGTTTCGGAAAAAAAAAAAAAAAAAAA 60368755 



Features flanking this part of subject sequence: 

30984 bp at 5' side: Cbp/p3 0 0-interacting transactivator , with Glu/Asp-: 
90153 bp at 3' side: CTP synthase 

Score =42.8 bits (46), Expect =2.2 
Identities = 26/28 (92%), Gaps = 0/28 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGG 2083 

I I I I I I I I III I I I I I I I I I I I I I I I 
Sbjct 11330728 TTGTTGTTGTTGTTGTTGTTGTTGATGG 11330701 



Features flanking this part of subject sequence: 

64368 bp at 5' side: discs, large (Drosophila) homolog-associated protein 3 
6709 bp at 3 ' side: hypothetical protein 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
Strand=Plus/Plus 

Query 2051 TGTGATTGTTGTTATTGATGTTGTTGTTGATG 2082 

III I I I I I I I I I III I I I I I I I I I I I II 
Sbjct 5407270 TGTTATTGTTGTTGTTGCTGTTGTTGTTGTTG 5407301 



Features in this part of subject sequence: 

eukaryotic translation initiation factor 2B, subunit 3 gamma 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 32/36 (88%), Gaps = 2/36 (5%) 
Strand=Plus /Minus 

Query 2244 TTGTTTTATTTTG-CTTCTTGGTCAAGAAAGGAGGG 2278 

I I I I I I II II I II I I I I I I I I I I I I I I I I I I I 
Sbjct 15312647 TTGTTTTATTTTGTTTTCTTGG-CAAGAGAGGAGGG 15312613 



Features in this part of subject sequence: 
ubiquitin specific protease 24 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 28/32 (87%), Gaps = 0/32 (0%) 
St rand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATGGCAAA 2087 

I I I I I I I I III I I I I I I I I I I I II I I I I 
Sbjct 25513245 TTGTTGTTGTTGTTGTTGTTGTTGTTGTCAAA 25513214 



Features in this part of subject sequence: 
DnaJ (Hsp40) homolog, subfamily C, member 6 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 36/44 (81%), Gaps = 1/44 (2%) 
Strand=Plus/Plus 

Query 2049 CCTGTGATTGTTGTTATTGATGTTGTTGTTGATGGCAAAAAAAA 2092 

I I I I I I I II II III I I I I I I I I I I I III I I I I I I I I 
Sbjct 35774590 CCTGTG-TAATTTTTGTTGTTGTTGTTGTTGTTGGGAAAAAAAA 35774632 
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Features in this part of subject sequence: 
adenylate kinase 5 isoform 1 
adenylate kinase 5 isoform 2 

Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2053 TGATTGTTGTTATTGATGTTGTTGTTG 2079 

II I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 47884298 TGTTTGTTGTTATTGTTGTTGTTGTTG 47884324 



Features flanking this part of subject sequence: 

20976 bp at 5' side: guanylate binding protein 1, interferon-inducible, 67kD 
23939 bp at 3 ' side: guanylate binding protein 2, interferon-inducible 



Score = 41.0 bits (44), Expect = 7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus /Minus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 59521838 TTGTTGTTATTGTTGTTGTTGTTGTTG 59521812 



Features flanking this part of subject sequence: 
1219349 bp at 5' side: similar to hCG2040669 

184274 bp at 3' side: polypyrimidine tract binding protein 2 



Score =41.0 bits (44), Expect =7.5 
Identities = 25/27 (92%), Gaps = 0/27 (0%) 
Strand=Plus/Plus 

Query 2056 TTGTTGTTATTGATGTTGTTGTTGATG 2082 

I I I I I I I I I I I I I I I I I I I I I I I II 
Sbjct 66975040 TTGTTGTTATTGTTGTTGTTGTTGTTG 66975066 



>ref I NT_010859 . 14 I Hsl8_11016 Homo sapiens chromosome 18 genomic contig, reference assembly 
Length=15400898 

Features flanking this part of subject sequence: 
37804 bp at 5' side: hypothetical protein 

48190 bp at 3' side: erythrocyte membrane protein band 4.1-like 3 



Score = 44.6 bits (48), Expect = 0.62 
Identities = 29/32 (90%), Gaps = 0/32 (0%) 
St rand=Plus /Minus 

Query 2050 CTGTGATTGTTGTTATTGATGTTGTTGTTGAT 2081 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Sbjct 5336492 CTGTTGTTGTTGTTATTGTTGTTGTTGTTGAT 5336461 



Database: human build 36.3 reference assembly genomic scaffolds 

Posted date: Apr 16, 2008 7:40 PM 
Number of letters in database: 1,523,044,440 
Number of sequences in database: 49,942 



Lambda K H 

0.634 0.408 0.912 

Gapped 

Lambda K H 

0.634 0.408 0.912 
Matrix: blastn matrix: 2 -3 
Gap Penalties: Existence: 5, Extension: 
Number of Sequences: 49942 
Number of Hits to DB: 5820376 
Number of extensions: 423311 
Number of successful extensions: 1102 
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Number of sequences better than 10: 52 

Number of HSP ' s better than 10 without gapping: 0 

Number of HSP ' s gapped: 1085 

Number of HSP ' s successfully gapped: 58 

Length of query: 2797 

Length of database: 5818011736 

Length adjustment: 35 

Effective length of query: 2762 

Effective length of database: 5816263766 

Effective search space: 16064520521692 

Effective search space used: 16064520521692 

A: 0 

XI: 22 (20.1 bits) 
X2: 33 (29.8 bits) 
X3: 110 (99.2 bits) 
31: 36 (33.7 bits) 
S2: 44 (41.0 bits) 
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